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As  the  field  of  robotics  continues  to  strive  forward,  the  need  for  artificial  tactile 
sensing  becomes  increasingly  evident.  Real-time,  local  processing  of  tactile  sensory 
data  also  becomes  a  crucial  issue  in  most  applications  of  tactile  sensing.  In  this 
thesis  it  is  shown  that  analog  neural  networks  provide  an  elegant  solution  to  some 
of  the  problems  of  low  level  tactile  data  processing.  We  consider  the  particular 
problem  of  ‘deblurring’  strain  data  from  an  array  of  tactile  sensors.  It  is  shown 
that  the  inverse  problem  of  deblurring  strain  measurements  to  recover  the  surface 
stress  over  a  region  of  contact  is  ill-posed  in  the  sense  defined  by  Hadamard.  This 
problem  is  further  complicated  by  the  corruption  of  sensor  data  by  noise.  We  show 
that  the  techniques  of  ‘regularization’  may  be  used  to  introduce  prior  knowledge 
of  the  solution  space  into  the  solutions  in  order  to  transform  the  problem  to  one 
which  is  well-posed  and  less  sensitive  to  noise.  The  particular  regularizer  chosen 
for  the  recovery  of  normal  stress  distributions  is  of  the  functional  form  of  Shannon 
entropy.  Formulation  of  the  inverse  problem  so  as  to  regularize  the  solutions  results 
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in  a  variational  principle  whichmust  be  solved  in  order  to  recover  the  surface  stress. 
An  analog  neural  network  which  provides  the  desired  solutions  to  the  variational 
principle  as  a  course  of  natural  time  evolution  of  the  circuit  dynamics  is  proposed 
as  a  solution  to  the  requirements  for  fast,  local  processing  in  tactile  sensing.  We 
discuss  performance  of  the  network  in  the  presence  of  noise  based  upon  computer 
simulations.  We  also  demonstrate,  by  means  of  a  breadboard  prototype  of  the  net¬ 
work,  the  speed  of  computation  achievable  by  such  a  network.  An  integrated  circuit 
implementation  of  the  proposed  network  has  been  completed  and  the  requirements 
of  such  implementations  is  discussed. 
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CHAPTER 

ONE 


INTRODUCTION 


.  Recent  years  have  seen  a  significant  increase  in  the  complexity  of  tasks  per¬ 
formed  by  robotic  manipulators.  As  the  complexity  of  these  tasks  continues  to 
grow,  the  need  for  automated  tactile  sensing  becomes  increasingly  evident.  The 
term  tactile  sensing,  as  used  here,  refers  to  the  continuous  sensing  of  forces  over 
regions  of  contact.  Included  in  this  definition  of  tactile  sensing  are  the  more  rudi¬ 
mentary  operations  of  binary  contact  sensing  and  pressure  sensing. 

Since  the  specific  requirements  of  robot  tactile  sensing  have  not  as  yet  been 
clearly  defined,  it  is  often  useful  to  view  tactile  sensing  in  humans  as  a  model  for 
artificial  tactile  sensing.  Tactile  sensing  in  humans  is  a  dynamic  process  in  which 
dexterous  hands  are  used  in  conjunction  with  dense  arrays  of  subcutaneous  sensors 
to  extract  information  about  the  contact  which  is  necessary  for  feature  identification 
and  formulation  of  manipulation  strategies  (see  [1]).  An  ‘ideal’  artificial  tactile 
sensing  system  should  strive  to  perform  a  similar  function. 

The  problem  of  tactile  sensing  can  be  hierarchically  separated  into  three  stages. 

(a)  At  the  bottom  of  the  hierarchy,  there  are  the  device  level  problems  of  de¬ 
signing  a  tactile  sensory  device,  and  of  designing  a  dextrous  manipulator  to 
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be  equipped  with  such  sensors.  According  to  a  survey  of  robotic  researchers 
and  manufacturers  (see  [2]),  tactile  sensors  should  be  distributed  in  arrays  on 
thin,  flexible,  compliant  substrates.  Also,  since  tactile  sensing  is  based  upon 
physical  contact,  it  is  required  that  the  entire  structure  comprising  the  tactile 
sensors  be  mechanically  durable  and  robust  against  environmental  variations. 
It  was  also  sugested  in  [2]  that  a  fair  amount  of  preprocessing  of  the  sensory 
data  be  performed  at  the  sensor  level  so  as  to  limit  the  quantity  of  data  to 
be  transferred  to  the  central  processing  unit. 

(b)  Given  data  from  the  tactile  sensors,  the  second  stage  of  the  hierarchy  is  con¬ 
cerned  with  low-level  processing  and  extraction  of  information.  For  example, 
often  raw  data  provided  by  the  sensors  are  not  measurements  of  contact 
forces,  but  are  related  in  some  manner  to  the  stress  profile  over  the  region 
of  contact.  As  will  be  shown  in  Chapter  2,  an  inverse  problem  arises  which 
must  be  solved  to  determine  the  stress  profile.  Detection  of  edges  and  identi¬ 
fication  of  conditions  such  as  slippage  should  also  be  performed  at  this  level 
of  the  hierarchy. 

(c)  The  top  most  level  in  this  hierarchical  picture  of  tactile  sensing  is  the  level 
at  which  decisions  are  made,  based  upon  information  provided  by  the  two 
previous  stages.  Formulation  of  manipulation  strategies  is  performed  at  this 
level.  For  example,  if  a  condition  of  slippage  has  been  recognized,  a  manip¬ 
ulation  strategy  should  be  formulated  so  as  to  alleviate  this  condition.  Also 
at  this  level  of  processing,  decisions  are  made  regarding  the  identification  of 
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a  grasped  object  based  upon  information  about  texture,  shape  (which  could 
have  been  determined  from  knowledge  of  edges),  and  material  (which  may 
possibly  be  determined  from  thermal  conductivity). 

It  is  among  the  goals  of  this  thesis  to  take  a  step  towards  integrating  the  two  lowest 
levels  of  this  hierarchy. 

In  the  case  of  human  tactile  sensing,  the  total  number  of  sensor  cells  in  a  finger¬ 
tip  area  of  20  x  30mm  can  exceed  60,000.  In  [2]  it  is  suggested  that  a  typical  array 
of  10  x  10  sensor  elements  per  square  inch  should  suffice  for  many  applications  of 
tactile  sensing.  Considering  only  this  reduced  number  of  100  sensors,  the  quantity 
of  data  that  must  be  transferred  to  the  central  processing  unit,  if  all  processing 
is  to  be  done  there,  is  still  prohibitive.  It  would  not  in  general  be  acceptable  to 
have  a  cable  running  from  the  hand  to  the  CPU  consisting  of  several  hundred  con¬ 
ductors.  In  addition  to  simply  conveying  the  information  to  the  central  processing 
unit,  there  is  also  the  actual  processing  task  which  would  demand  much  of  the 
available  processing  time.  It  is  clear  that  a  great  deal  of  preprocessing  should  be 
performed  at  or  near  the  sensor  array  itself.  In  doing  so,  a  simpler  representation 
of  the  data  may  be  produced  which  could  then  be  conveyed  to  the  central  process¬ 
ing  unit.  Demands  upon  the  processing  power  of  the  CPU  would  then  be  limited 
to,  at  most,  only  the  top  level  in  the  tactile  sensing  hierarchy.  In  humans,  useful 
information  is  often  acquired  through  active  manipulation.  A  grasped  object  is 
often  scraped,  rolled,  mutilated  to  determine  its  properties  ([2]).  Therefore  it  is 
reasonable  to  assume  that  in  robots,  as  in  humans,  static  contact  analysis  together 
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with  dynamical  analysis  may  prove  to  be  useful.  This  dynamical  aspect  of  tactile 
sensing,  together  with  the  fact  that  tactile  sensing  is  generally  to  be  applied  in  a 
‘real  time’  environment,  implies  the  need  for  feist  processing  of  tactile  sensory  data. 

In  this  thesis  we  discuss  a  framework  within  which  at  least  some  of  the  second 
level  of  tactile  information  processing  may  be  performed.  The  approach  taken  is 
designed  to  meet  the  requirement  of  local  fingertip  processing  as  well  as  the  demand 
for  fast  computation.  The  problem  considered  in  this  thesis  is  the  determination 
of  surface  stress  from  an  array  of  sensors  which  provides  measurements  of  strain 
induced  in  a  elastic  medium  by  contact  at  the  boundary.  Since  it  can  be  assumed 
that  the  data  provided  by  the  sensors  is  corrupted  by  noise  it  is  necessary  to  consider 
this  additional  aspect  of  the  problem  as  well.  In  order  to  meet  the  requirements  of 
fast  local  fingertip  processing,  the  paradigm  of  neural  networks  is  considered. 

Inspired  by  the  work  of  a  of  neurophysiologists  and  psychologists  and  other 
researchers  on  the  mechanisms  of  computation,  learning  and  memory  in  biological 
systems,  artificial  neural  networks  are  an  attempt  to  reproduce  the  computational 
efficiency  observed  in  the  nervous  system.  An  artificial  neural  network  may  be  de¬ 
fined  as  a  highly  interconnected  network  of  simple  processing  units.  The  processing 
units  themselves  are  rarely  more  than  simple  amplifiers  (usually  nonlinear  ampli¬ 
fiers  such  as  those  with  sigmoidal  characteristics  are  used),  yet  neural  networks 
have  in  many  instances  demonstrated  an  ability  to  solve  complex  problems.  The 
computational  power  of  artificial  neural  networks  is  embedded  in  the  nature  of  con¬ 
nectivities  between  the  processing  units  (or  neurons)  of  which  they  are  composed. 
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Neural  networks1  are  usually  regarded  as  being  comprised  of  layers  of  neurons.  In 
general  connections  in  a  neural  network  are  defined  between  neurons  in  different 
layers  as  well  as  between  neurons  of  a  single  layer.  Networks  in  which  connections 
occur  from  the  output  to  the  input  of  a  single  neuron  (feedback  connections)  are 
termed  recurrent  networks.  Associated  with  a  connection  (also  called  a  synapse ) 
between  two  neurons,  say  neuron  i  and  neuron  j,  is  a  number  called  the  weight 
of  the  connection  (sometimes  referred  to  as  the  synaptic  weight)  between  neuron  i 
and  neuron  j,  which  determines  the  effect  that  the  output  of  neuron  i  has  upon  the 
activity  of  neuron  j.  For  example,  if  the  output  of  neuron  i  is  Vi  then  the  input  to 
neuron  j  due  to  the  connection  of  neuron  i  to  neuron  j  is  given  by  It  is  the 

connectivity  profile  (distribution  of  connection  weights)  which  determines  the  com¬ 
putational  task  performed  by  any  given  network.  A  layer  of  a  neural  network  which 
accepts  external  inputs  is  called  an  input  layer  for  the  network.  Similarly  a  layer 
whose  outputs  are  considered  as  outputs  of  the  network  as  a  whole  is  termed  an 
output  layer.  Since  both  the  input  and  output  layers  can  consist  of  many  neurons, 
the  inputs  and  outputs  of  a  neural  network  are  vectors  (in  finite  dimensional  vector 
spaces).  Computation  in  a  neural  network  is  either  instantaneous  as  in  feedforward 
networks  or  involves  the  time  evolution  of  the  network  as  a  dynamical  system  (as 
in  recurrent  networks).  In  either  case  the  the  network  operates  on  vectors  and  the 
computation  occurs  in  a  parallel  asynchronous  fashion. 

There  is  evidence  that  computation  in  the  nervous  system  is  mostly  performed 

^he  term  neural  networks  will  be  used  in  reference  to  artificial  neur.il  networks  unless  other¬ 
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in  analog.  The  paradigm  of  neural  networks  also  provides  a  formalism  for  the  ana¬ 
log  hardware  implementation  of  inherently  parallel  algorithms.  Biological  neurons 
are  often  modeled  as  integrators  (with  the  sum  of  all  inputs  to  the  neuron  as  their 
inputs)  followed  by  output  functions.  In  terms  of  analog  circuits  this  corresponds 
to  a  simple  RC  integrator  circuit  followed  by  an  amplifier  with  the  desired  char¬ 
acteristics.  Connections  between  neurons  can  be  implemented  as  resistors.  If  the 
output  of  neuron  i  is  a  voltage  which  is  connected  to  the  input  of  neuron  j  through 
a  resistor  Rij  then  Ohm’s  Law  dictates  that  the  current  input  to  neuron  j  due 
to  the  output  Vi  of  neuron  i  is  RijVi.  Hence  the  synaptic  weight  Wij  is  simply 
1  /Rij.  Hence  having  arrived  at  a  neural  network  model  for  computation,  imple¬ 
mentation  of  the  network  as  an  electrical  circuit  is  a  natural  extension.  Advances 
in  microelectronics  have  provided  the  technology  required  for  implementations  of 
‘small’  neural  networks.  VLSI  implementations  of  networks  consisting  of  hundreds, 
thousands,  even  tens  of  thousands  of  neurons,  should  be  possible  with  further  ad¬ 
vances  in  microelectronic  technologies,  but  as  yet  are  unrealizable  due  mostly  to 
limitations  in  integrated  circuit  density  and  physical  size.  Analog  computation  is 
limited  in  speed  only  by  the  propagation  time  of  electrons  in  the  conductors  and  by 
any  dynamics  induced  by  capacitors  and  inductors  (parasitic  or  otherwise).  Thus 
analog  computation  is  inherently  fast.  Integrated  circuit  implementation  of  neural 
networks  to  perform  the  low  level  processing  tasks  of  the  second  level  of  hierar¬ 
chy  in  tactile  sensing  would  provide  fast  computation  which  can  be  performed  in 
close  physical  proximity  to  the  sensors.  The  resultant  of  processing  performed  by 
such  a  network  can  then  be  transmitted  to  the  central  processing  unit  for  higher 
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level  interpretation  (and  decision  making  based  upon  the  interpretation)  using  rel¬ 
atively  few  conductors.  Such  an  approach  would  also  result  in  lower  computational 
demand  upon  the  CPU. 

In  Chapter  2  a  particular  inverse  problem  which  arises  in  the  context  of  tactile 
sensing  is  introduced.  It  is  shown  that  inverse  problems  of  this  nature  are  ‘ill-posed’ 
and  hence  solutions  must  be  obtained  using  special  techniques.  The  inverse  problem 
is  formulated  as  a  variational  principle  and  ‘regularization’  is  used  to  compensate 
for  the  ill-posedness  of  the  problem  and  provide  for  reliable  computation  in  the 
presence  of  sensor  noise. 

In  Chapter  3  some  aspects  of  nonlinear  RC  electrical  networks  are  discussed. 
It  is  shown  that  under  certain  conditions  it  is  possible  to  guarantee  stability  and 
explicitly  determine  a  strict  Lyapunov  function  for  such  a  network.  Since  such 
a  Lyapunov  function  is  minimized  in  the  course  of  natural  time  evolution  of  the 
network,  steady  state  outputs  of  the  network  correspond  to  the  solutions  of  a 
minimization  problem  (defined  by  the  Lyapunov  function). 

An  analog  neural  network  to  solve  the  inverse  problem  of  tactile  sensing  is 
described  in  Chapter  4.  Using  the  results  of  Chapter  3,  an  energy  function  for 
the  proposed  network  is  determined  and  shown  to  be  equivalent  to  the  variational 
principle  formulated  in  Chapter  2  for  regularized  solution  of  the  inverse  problem. 
Computer  simulations  of  the  proposed  network  are  presented  in  order  to  evaluate 
performance  of  the  network  in  the  presence  of  noise. 

Chapter  5  is  concerned  with  the  analog  electrical  circuit  implementation  of  the 
network  described  in  Chapter  4.  A  prototype  breadboard  model  of  the  network  as 
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well  as  a  preliminary  attempt  at  integrated  circuit  implementation  of  the  network 
are  discussed. 


8 


CHAPTER 

TWO 


THE  INVERSE  PROBLEM  IN  TACTILE  SENSING 


In  this  chapter  we  examine  the  fundamental  inverse  problem  which  arises  in 
the  context  of  robot  tactile  perception.  Inverse  problems  of  a  similar  nature  also 
arise  in  the  field  of  computational  vision  in  early  vision  problems  which  have  often 
been  designated  as  inverse  optics  (see  Poggio  and  Koch  [3]).  Hadamard  [4]  in  1923 
defined  a  problem  to  be  well-posed  if, 

1.  The  solution  exists. 

2.  The  solution  is  unique. 

3.  The  solution  depends  continuously  on  the  initial  data. 

If  any  of  the  above  three  criteria  are  not  satisfied,  the  problem  is  said  to  be  ill- 
posed.  Most  inverse  problems  which  arise  in  physical  settings  (such  as  those  in 
early  vision)  are  ill-posed  and  thus  require  special  techniques  to  solve. 

In  Section  2.1  and  2.2,  models  of  the  transduction  process  involved  in  tactile 
sensing  are  presented  and  the  inverse  problem  is  defined.  In  Section  2.4  it  is  shown 
that  since  the  integral  (convolution)  operator  describing  the  transduction  process 
is  compact,  linear  and  with  an  infinite  dimensional  domain,  the  inverse  operator  is 
unbounded  and  thus  the  problem  is  ill-posed.  Since  data  from  an  array  of  tactile 
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sensors  is  discrete,  in  Section  2.4  the  process  of  approximating  the  integral  operator 
by  a  finite  matrix  is  justified. 

2.1  Tactile  Sensors  and  Compliant  Contact 

Numerous  tactile  sensors  have  been  designed  and  fabricated  for  use  with  robotic 
end-effectors.  All  such  sensors  have  been  based  on  the  deformation  of  materials 
by  contact  forces  and  the  measurement  and  interpretation  of  the  deformation  to 
determine  the  forces  inducing  it.  Examples  of  some  approaches  to  tactile  sensor 
design  are  to  be  found  in  [5],  [6],  [7],  [8],  and  [9].  In  typical  approaches  to  tactile 
sensor  design,  the  transduction  process  is  effected  by  materials  such  as  piezo-electric 
polymers  [5]  and  crystals,  conductive  rubber  [5],  and  piezoresistive  materials  such 
as  silicon  [10].  In  [9]  a  tactile  sensor  is  described  which  is  designed  to  operate  based 
upon  changes  in  optical  characteristics  of  a  material  at  boundaries  (total  internal 
reflection).  Such  an  approach  has  the  advantage  of  very  high  spatial  resolution  since 
it  is  not  necessary  to  construct  arrays  of  discrete  sensors.  Another  novel  approach  to 
tactile  sensor  design  is  described  in  [11]  where  transduction  is  based  upon  variations 
in  magnetic  fields  which  are  measured  by  a  VLSI  array  of  Hall-effect  sensors.  In 
most  approaches  to  tactile  sensing,  there  arises  an  inverse  problem,  namely,  given 
data  from  the  sensors,  determine  the  force  profile  at  the  contact  region.  For  the 
purpose  of  providing  a  concrete  example,  we  concern  ourselves  here  with  a  silicon 
based  piezoresistive  triaxial  tactile  sensor  which  has  been  designed  and  fabricated 
at  the  Naval  Research  Laboratories  in  Washington  D.C.  and  described  in  [10]. 

The  sensor  is  constructed  as  a  micromachined  silicon  mesa  surrounded  by  a  thin 
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diaphragm  (see  Figure  2.1).  On  the  front  side  of  the  sensor,  beneath  the  mesa,  an 
array  of  resistors  is  formed  in  the  silicon  by  a  diffusion  process.  An  array  of  these 
sensors  is  constructed  on  a  single  die  and  then  bonded  and  packaged  to  be  mounted 
on  a  robot  fingertip.  As  force  is  applied  to  the  mesa,  the  diaphragm  is  deformed 
which  in  turn  deforms  the  diffused  resistors  causing  a  change  in  their  resistances. 
The  changes  in  resistances  are  measured  to  determine  the  magnitude  and  direction 
of  the  applied  force.  Sensitivity  of  the  tactile  sensor  is  determined  by  the  magnitude 
of  strain  induced  in  the  resistors  by  a  given  force.  In  order  to  increase  sensitivity  of 
the  sensors,  it  is  necessary  to  decrease  the  thickness  of  the  diaphragm  and  thereby 
increase  the  likelihood  of  its  rupture  due  to  excessive  force.  To  attenuate  large 
forces  and  provide  damping  against  large  impulsive  impact  forces,  the  array  of 
sensors  is  covered  by  a  layer  of  compliant  material  such  as  rubber  or  polyimide. 

The  use  of  a  compliant  layer  for  fingertip  contact  serves  three  beneficial  func¬ 
tions. 

(1) As  mentioned  previously  the  compliant  layer  serves  to  protect  the  sensors 
from  damage  due  to  contact  forces. 

(2)  In  order  to  facilitate  a  stable  grasp  it  is  desirable  to  enhance  friction  at  the 
contact  surfaces.  For  given  material  at  the  contact  surface,  this  can  be  achieved 
by  maximizing  contact  area.  A  rigid  material  in  contact  with  an  irregular  or 
bumpy  surface,  contacts  the  surface  at  a  few  discrete  points  only  (see  Figure  2.2(a)) 
whereas  a  compliant  material  can  conform  to  the  surface  and  thus  maximize  contact 
area  (see  Figure  2.2(b)).  Since  the  tactile  sensing  array  provides  the  desired  contact 
surface  on  the  robot  finger,  it  is  beneficial  to  provide  compliant  contact  atop  the 
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1 

(b) 

Figure  2.1:  (a)  Schematic  top  and  crossectional  view  of  a  single  piezoresistive  tactile 
sensing  element,  (b)  Application  of  forces  to  the  mesa  deforms  the  diaphragm, 
causing  changes  in  the  resistance  of  the  piezo  resistors. 
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Figure  2.2:  (a)  Rigid  contact  with  an  irregular  surface  results  in  contact  at  a  number 
of  discrete  points  only,  (b)  Compliant  contact  with  an  irregular  surface 

array  of  sensors. 

(3)The  third  function  of  the  compliant  layer  is  due  to  the  resultant  ‘blurring’ 
of  the  contact  force  distribution  which  causes  information  about  the  entire  stress 
distribution  to  be  passed  to  each  individual  sensor  element.  In  the  absence  of  such 
blurring,  determination  of  the  applied  stress  between  sample  points  would  not  be 
possible. 

We  are  now  presented  with  the  following  scenario  (see  Figure  2.3(a)).  Contact 
is  established  with  an  object  resulting  in  a  stress  distribution  at  the  surface  of 
the  compliant  pad.  Stress  at  the  surface  induces  strain  in  the  compliant  material 
which  is  sampled  by  the  sensors  at  a  depth  x  beneath  the  surface.  This  leads  to 
the  following  definition  in  words. 

Definition  2.1  Given  samples  of  a  strain  distribution,  measured  by  sensors  at  a 
given  depth  beneath  the  surface  of  a  compliant  layer,  the  inverse  problem  of  tactile 
sensing  refers  to  the  problem  of  determining  the  surface  stress  distribution  which 


induced  the  measured  strain. 


Normal  Stress  f 


(a)  UlllllllUllil 


x 


Figure  2.3:  (a)  Stress  applied  at  the  boundary  of  elastic  layer  with  strain  sensors 
beneath  surface,  (b)  Elastic  layer  extends  from  ymin  to  ymax  with  sensors  at  depth 
x  beneath  the  surface  at  intervals  of  Ay. 
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2.2  Modeling  the  Transduction  Process 


For  sake  of  simplicity  in  the  current  discussion,  two  assumptions  are  made,  (i)  The 
general  three  dimensional  problem  is  reduced  to  a  two  dimensional  setting,  i.e.  we 
consider  a  linear  array  of  sensors  with  planar  stress  applied  to  the  pad.  (ii)  It  is 
also  assumed  that  the  compliant  layer  is  actually  a  homogeneous,  isotropic,  linear, 
elastic  half-space. 

To  understand  the  inverse  problem,  a  model  for  the  forward  transduction  pro¬ 
cess  is  first  developed  i.e.  a  model  describing  the  relationship  between  stress  applied 
to  the  compliant  layer  and  the  strain  induced  at  a  depth  x  beneath  the  surface. 
From  the  theory  of  elasticity  (see  Timoshenko  and  Goodier  [11])  two  relationships 
(for  detailed  derivations  see  Yang  [12])  can  be  derived.. 

Strain  at  a  depth  x  beneath  the  surface  of  the  compliant  material  due  to  normal 
stress  at  the  surface  is  given  by 

/oo 

K(y,yo)qv{yo)dy0  (2.2.1) 

-00 


where,  px{-)  is  the  strain  at  depth  x,  qv(-)  is  the  surface  stress,  and  &£(•,•)  is  the 
convolution  kernel  relating  the  two  given  by, 


Kfayo) 


2x((l  -  u)2x2  -  +  l)(y  -  yp)2) 

kE(x2  +  (y  -  y0)2)2 


(2.2.2) 


where,  v  is  Poisson’s  ratio  for  the  material  and  E  is  the  modulus  of  elasticity. 
Strain  due  to  tangential  stress  applied  at  the  surface  is  given  by, 


Px 


K{y,yo)qt{yo)dy0, 


(2.2.3) 


15 


where,  is  the  tangential  surface  stress  and 


yo) 


2 (y  -  yo)((l  -  v)2x'1  -  v(v  +  1  ){y  -  y0 )2) 
kE(x 2  +  (y  -  ?/o)2)2 


(2.2.4) 


For  the  sake  of  simplicity  we  only  consider  the  case  of  normal  stress  throughout 
this  thesis  since  the  case  with  tangential  stress  is  obviously  analogous1 . 

Since  measurements  of  strain  are  made  at  a  discrete  number  of  points  only, 
equation  (2.2.1)  must  be  discretized. 

Assume  that  the  sensors  are  distributed  uniformly  (equal  spacing)  beneath  the 
surface  of  the  compliant  layer.  Let  A py  be  the  distance  between  points  at  which 
strain  is  sampled.  So, 

Umax  Umin 

TV-  1 

(see  Figure  2.3),  where  N  is  the  total  number  of  sensors.  Let  A qy  be  the  distance 
between  points  where  the  stress  profile  is  to  be  reconstructed.  Although  it  is  not 
necessary  for  A py  and  A qy  to  be  equal,  for  the  current  discussion  we  let  A py  — 
A qy  =  A y.  To  obtain  a  discretized  version  of  equation  (2.2.1),  let  ex  be  the  vector 
of  strain  samples  i.e., 

€r  i  •  •  •  i  ^x^)  • 


Therefore, 


—  Px(,ymin  T  2^!/)  I  —  1,  .  .  .  ,  N. 


Similarly,  let  /„  =  (/„, , . . . ,  fVN)T  be  the  vector  obtained  from  the  stress  distribution 
lrThe  analogy  referred  to  here  does  not  extend  to  the  choice  of  regularizer  for  the  case  of 
tangential  stress. 


16 


as, 

f v%  —  4“  lA'lj')  2  —  1,  •  •  •  ,  N . 

The  convolution  kernel  &”(•,  •)  can  be  discretized  to  form  the  matrix  T  =  {7q}  by 
letting 

Tij  =  kvx{yi,yj ), 

where  jq  =  ymin-\-iAy  for  i  =  1, . . . ,  N.  Hence  the  discretization  of  equation  (2.2.1) 
results  in, 

ex  =  T  ■  fv.  (2.2.5) 

The  discretized  inverse  problem  can  now  be  stated  more  precisely  as  being  the 
problem  of  determining  /„  given  ex  and  T.  It  will  be  shown  in  the  next  section 
that  this  inverse  problem  is  ill-posed  the  sense  defined  by  Hadamard  and  that  in 
situations  where  the  strain  measurements  ex  are  corrupted  by  noise,  solutions  to 
the  problem  are  not  ‘well  behaved’. 

Returning  to  the  third  desirable  feature  of  compliant  contact,  we  observe  that 
if  we  let  T  be  a  non-square  matrix  then  in  solving  the  inverse  problem  we  are 
attempting  to  reconstruct  the  surface  stress  at  points  along  the  surface  other  than 
those  directly  above  the  sensors.  This  is  only  possible  since  the  blurring  of  signals 
by  the  compliant  layer  results  in  information  about  most  of  the  stress  distribution 
at  the  surface  being  passed  to  every  sensor.  The  exceptions  to  this  are  due  to  zeros 
in  the  convolution  kernel. 
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2.3  Integral  Equations  of  the  First  Kind 


Equation  (2.2.1)  above  is  of  a  class  more  generally  referred  to  as  integral  equations 
of  the  first  kind.  The  general  form  of  integral  equations  of  the  first  kind  is  given 

by, 

g(t )  =  f  k(t,s)f(s )  ds  c  <  t  <  d.  (2.3.6) 

J  a 

Equation  (2.3.6)  may  be  rewritten  in  operator  notation  as, 

g(t)  =  ( Kf)(t )  (2.3.7) 

where  K  is  the  integral  operator  with  kernel  function  k.  In  the  particular  case  of 
equation  (2.2.1)  K  is  a  convolution  operator  with  the  convolution  kernel  k.  Since 
g(-)  and  /(•)  in  this  case  represent  strain  and  stress  respectively  and  are  therefore 
signals  with  finite  energy,  we  consider  the  case  where  g,f  E  T2([a,6])  with  norm 
defined  by, 

\\h\U  =  \h(x)\2  dxj  (2.3.8) 

and  a  =  ymin,b  =  ymax .  Thus  we  are  interested  in  the  case  when,  kx(y,y0)2£ 
X2([a,  b]  x  [a,  6])  and 

Px{y)  =  {Kqv)(y)  =  f  kx(y,y0)qv(y0)dy0.  (2.3.9) 

J  a 

2.4  Ill-Posedness  of  the  Inverse  Problem 

To  demonstrate  the  ill-posed  nature  of  the  inverse  problem,  we  would  like  to  do 
the  following, 

2We  will  use  kx(y,yo)  to  denote  l'*(t/,yo)  unless  otherwise  indicated 
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1.  Show  that  in  the  infinite  dimensional  setting  of  equation  (2.3.9)  the  inverse 
of  the  integral  operator  K  is  unbounded. 

2.  Verify  that  the  finite  matrix  representation  of  K  in  equation  (2.2.5)  (obtained 
by  discretization  of  the  kernel  function)  is  indeed  a  justifiable  approximation 
of  the  operator  K. 

3.  Show  that  the  finite  dimensional  manifestation  of  the  unboundedness  of  the 
inverse  of  K  occurs  in  the  ill-conditioning  of  the  finite  matrix  representation 
of  hr. 

In  order  to  clarify  latter  discussion  we  present  some  definitions,  notation,  and 
theorems.  Proofs  of  the  theorems  are  to  found  in  Gohberg  and  Goldberg  [13] 

and/or  Rudin[14], 

Notation: 


H,  Hi  i  =  1,2, . 

. .  Hilbert  Spaces 

L(X) 

Bounded  linear  operators  on  X 

L(X,  Y) 

Bounded  linear  operators  from  X  to  Y 

Sp{fl,<t>2,.  ■  ■} 

Span  of  {<f>i,<j>2,-- ■} 

Im(A) 

Image  of  A 

A* 

Adjoint  of  the  operator  A 

Definition  2.2  ,4n  operator  A  €  L(Hi,H2)  is  compact  if  for  each  sequence  {x„} 
in  Hi,  such  that  ||a:n||  =  1,  tke  sequence  { Axn }  has  a  subsequence  which  converges 
in  H2. 


Lemma  2.1  If  A  £  L(Hi,H2 )  is  of  finite  rank  then  A  is  compact,  m 
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Theorem  2.1  Let  {A„}  be  a  sequence  of  compact  operators  in  L(Hi,H2)  such  that 
||/ln  —  j4| |  — ►  0  as  n  oo,  where  A  £  L(Hi,H2).  Then  A  is  compact.  ■ 

To  show  that  the  integral  operator  K  is  bounded,  we  note  that  since  k  £ 
L2([a,b]  x  [a,  6]), 

f  I  \k(t,s)\2  ds  dt  <  oo.  (2.4.10) 

J  a  J  a 


Now, 


(Kf)(t)  =  /  k(t,s)f(s)  ds. 

J  a 


(2.4.11) 


So  by  Schwarz’s  inequality, 


jf‘  »)/(»)!  <fe  <  (f‘  |i(i,4)|2  is)'!\  f  |/MI2  dsf\  (2.4.12) 


Therefore, 


UKfU1  <  J‘(J‘\k(t,s)f(s)\  dsf  <  m2J'f\k(t,s)\‘  ds  dt.  (2.4.13) 


So, 


Hit'll2  <  f  f  \k(t,  s)|2  ds  dt  <  oo.  (2.4.14) 

J  a  Ja 

Hence  the  operator  I<  is  bounded.  I<  is  clearly  linear  as  well  and  so  K  6  L(H) 
where  H  —  Z/2([^*^])*  Since  from  equation  (2.2.2)  we  see  that  fc(«s,£),  the 

operator  K  is  also  self  adjoint. 

To  show  that  K  is  a  compact  operator  we  construct  a  sequence  of  operators  of 
finite  rank  and  then  use  Theorem  2.1. 

Let  <j>i ,  <f>2 ,  ■ . .  be  an  orthonormal  basis  for  L2([a,b]).  Then, 

$ij(t,s)  =  <t>i{t)<j>j(s)  i,j  =  1,2,... 
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is  an  orthonormal  basis  for  L2[a,  b }  x  [a,  6])  (see  Gohberg  and  Goldberg  [13]).  There¬ 


fore, 


Let 


Then, 


k(t,  s) 


CO 


=  X  <  >  *«(*,«)• 


(2.4.15) 


kn(t,s)  =  X  <  *i$ii  >  $«j(M)- 

i.J  =  l 


(2.4.16) 


ll*-*n||-0, 


(2.4.17) 


where  ||  •  ||  is  the  norm  on  L2([a,  b]  x  [a,  b]).  Let  I<n  be  the  integral  operator  defined 
on  L2([a,b\)  by, 

(■ Knf){t)—  /  kn(t,s)f(s)  ds.  (2.4.18) 

J  a 

So  Kn  is  bounded,  linear  and  of  finite  rank  since  Im(Kn)  C  Sp{<f> i , . . . ,  There¬ 
fore  by  Lemma  1,  Kn  is  also  compact.  Now, 


Ml  <(/*/*  ds  it)'11  =  ||l||.  (2.4.19) 

Ja  -J  a 


Thus  applying  (2.4.17)  and  (2.4.19)  to  K  —  Kn,  we  get, 


\\K-Kn\\  <  ||fc-  Ml  -»  0. 


(2.4.20) 


So  by  Theorem  2.1,  K  is  compact. 

To  summarize,  we  have  thus  far  shown  that  the  integral  operator  K  is  bounded, 
linear,  self  adjoint  and  compact. 

Although  every  linear  operator  on  a  finite  dimensional  Hilbert  space  over  (T 
has  an  eigenvalue,  it  is  not  true  that  even  a  self  adjoint  operator  on  an  infinite 
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dimensional  Hilbert  space  must  have  an  eigenvalue.  The  next  few  theorems  are 
concerned  with  the  eigenvalues  of  such  operators. 

Theorem  2.2  (a) Any  eigenvalue  of  a  self  adjoint  operator  is  real 

(b) If  A  is  an  eigenvalue  of  A  G  L{H )  then,  |A|  <  ||A||. 

(c) If  A  G  L(H)  is  compact  and  self  adjoint  then  A  has  an  eigenvalue  and  at 
least  one  of  the  numbers  ||A||  or  —  ||A||  is  an  eigenvalue  of  A.  ■ 

Theorem  2.3  Let  A  G  L(H )  be  a  compact  self  adjoint  operator,  where  H  is  an 
infinite  dimensional  Hilbert  space.  Then  the  spectrum  a  (A)  of  A  consists  of  zero 
and  the  eigenvalues  of  A.  fO  G  &{A))  ■ 

Now,  (A I  —  K )  is  invertible  for  \fca(K).  Since  K  is  compact  and  self  adjoint, 
by  Theorem  2.3,  0  G  cr(K).  Hence  the  inverse  of  (A I  —  K)  is  unbounded  for 
A  =  0,  which  is  equivalent  to  saying  that  the  inverse  of  the  integral  operator  K  is 
unbounded. 

From  the  above  result,  it  is  clear  that  in  the  infinite  dimensional  case  the  inverse 
problem  is  ill-posed  in  the  sense  of  Hadamard  since  the  inverse  of  the  operator  K 
is  unbounded,  solutions  will  not  depend  continuously  upon  the  initial  data. 

2.5  Finite  Matrix  Approximation  of  the  Convolution  Kernel 

In  equation  (2.2.5)  the  convolution  operator  K  has  been  approximated  by  a  finite 
rank  operator  by  discretizing  the  kernel  k  and  considering  only  samples  of  the 
stress  and  strain.  In  order  to  justify  the  use  of  such  an  approximation,  we  state 
the  following  theorem. 
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Theorem  2.4  Every  compact  operator  in  L{H\,H2)  is  the  limit,  in  norm  ,  of  a 
sequence  of  operators  of  finite  rank.  9 


In  fact,  in  Section  2.4  it  was  shown  that  K  is  the  limit  (in  norm)  of  the  sequence 
of  finite  rank  operators  {/f„}. 

If  {<j)j}  is  an  orthonormal  basis  for  L2([a,b])  we  know  that 

<MM)  =  -*\J  =  1,2,...  (2.5.21) 


forms  an  orthonormal  basis  for  L2([a,  b]  x  [a,  b}).  The  infinite  matrix  representation, 
[a,j],  of  the  integral  operator  K  with  respect  to  the  basis  is  given  by, 


a{j  =<  K<f>j,4>i  >=  f  f  k(t,s)(j)j(s)<j)i(t)  ds  dt  =<  k,$ji  >  .  (2.5.22) 

Ja  Ja 


Since  0  €  cr(K)  and  A  =  [a,y]  is  unitarily  equivalent  to  K,  zero  is  also  in  the 
spectrum  of  A  and  thus  the  inverse  of  A  is  also  not  a  bounded  operator.  Theorem 
2.4  says  that  the  operator  I<  can  be  approximated  arbitrarily  well  by  an  operator 
of  finite  rank,  say  Kn  (of  rank  n)  with  finite  matrix  representation  An.  As  n  ->  oo 
the  approximation  becomes  better,  however,  since  0  €  cr(K)  ,  as  n  — ►  oo  at  least 
one  of  the  eigenvalues  of  An  approaches  zero.  By  Theorem  2(b),  at  least  one  of  the 
eigenvalues  of  An  approaches  either  ||AT||  or  —  ||/fT||. 

The  condition  number  P(A )  of  a  matrix  A  is  defined  as 


P(A)  = 


max 

i  iMili 

min 


(2.5.23) 


j  |Ay(A)| 

In  solving  a  finite  system  of  linear  equations  of  the  form  Ax  —  y,  the  condition 
number  is  a  measure  of  sensitivity  of  solutions  to  errors  in  the  initial  data  y  and 
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approximations  made  in  the  inversion  of  the  matrix  A  (e.g.  finite  word  length 
effects)  on  the  solution  x.  Large  values  of  P(A)  result  in  large  errors  in  the  solution. 
In  the  event  that  P(A)  is  large  (P(A)  =  1  being  the  best  case),  we  say  that  the 
matrix  A  is  ‘ill-conditioned’. 

It  is  clear  from  the  observations  made  earlier  about  the  eigenvalues  of  An  that  as 
n  — *  oo,  An  becomes  increasingly  ill-conditioned  since  the  denominator  in  equation 
(2.5.23)  approaches  zero  as  the  numerator  approaches  ||/fj|. 

To  conclude  this  section,  we  summarize  the  results.  We  have  shown  that  in  the 
infinite  dimensional  case,  the  inverse  problem  is  ill-posed  in  the  sense  of  Hadamard. 
Ill-posedness  of  the  problem  in  this  case  is  due  to  the  unboundedness  of  the  inverse 
of  the  integral  operator  K  which  is  in  violation  of  Hadamard’s  third  requirement 
that  the  solution  depend  continuously  on  the  initial  data  for  the  problem  to  be  well 
posed.  We  have  justified  approximating  the  operator  K  by  a  finite  matrix  since 
any  compact  operator  in  L(Hi,  H^)  can  be  approached  (in  norm)  by  a  sequence  of 
operators  of  finite  rank.  Lastly,  we  have  made  the  observation  that  unboundedness 
of  the  inverse  of  K  induces  ill-conditioning  of  the  finite  matrix  approximation  to 
I<. 


2.6  Solving  the  Inverse  Problem 

It  was  shown  in  the  last  section  that  the  problem  of  solving  Kx  —  y,  where  K 
is  an  integral  operator  of  the  type  defined  in  equation  (2.3.9)  is  ill-posed.  There 
exists  a  large  body  of  literature  devoted  to  approximating  the  solutions  of  ill-posed 
problems  (see  e.g.  Tikhonov  [15]  and  Tikhonov  and  Arsenin  [16]).  One  successful 
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technique  for  solving  ill-posed  problem  is  regularization  which  was  introduced  by 
Tikhonov  [15]  in  1963.  Ill-posed  problems  such  as  the  one  considered  here  are  often 
insufficiently  constrained  and  require  the  imposition  of  additional  constraints  for 
the  solution  to  be  well  defined.  Regularization  is  a  technique  in  which  the  problem 
is  formulated  as  a  variational  principle  which  is  then  used  to  impose  physical  con¬ 
straints  on  the  solution.  A  variational  principle  defines  the  solution  to  a  problem 
as  the  function  which  minimizes  an  appropriate  cost  functional  (Poggio  and  Koch 

[3])-  . 

Regularization  requires  the  choice  of  a  norm  ||  •  ||  and  of  a  stabilizing  functional 
(typically  of  the  form  j|Pa:||).  The  stabilizing  functional  embodies  the  physical 
constraints  of  the  problem  and  thus  must  be  chosen  only  after  careful  analysis  of  the 
physical  setting  in  which  the  problem  arises.  Constraints  such  as  smoothness  and 
boundedness  of  solutions  may  be  imposed  by  appropriate  choice  of  the  stabilizer. 

The  problem  of  solving  Kx  =  y  can  be  formulated  as  a  variational  principle 
simply  by  choosing  a  norm  ||  •  ||  and  the  finding  x  which  minimizes 

il Kx  -  y 1 1 

To  regularize  the  problem  additional  constraints  are  imposed  through  the  stabiliz¬ 
ing  functional. 

Standard  regularization  theory  is  composed  primarily  of  three  methods  (see 
Poggio  and  Torre  (1984)). 

1.  Among  all  x  which  satisfies  the  condition  ||Px||  <  c,  where  c  is  a  constant, 
find  x  which  minimizes  \\Kx  —  y||. 
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2.  Among  all  x  which  satisfies  \\Kx  —  y||  <  e  , where  e  is  chosen  to  represent 
estimated  errors,  find  x  which  minimizes  ||Px||. 

3.  Find  x  which  minimizes 

\\Kx  —  j/||2  +  A[|Px(|2  (2.6.24) 

where  A  is  called  the  regularization  parameter. 

In  standard  regularization  theory,  the  operator  P  is  linear  and  the  norm  ||  •  || 
is  derived  from  an  inner  product.  For  such  quadratic  variational  principles,  of  the 
form  (2.6.24),  it  can  be  shown  that  under  mild  conditions  the  solution  space  is 
convex  (which  implies  the  existence,  uniqueness  and  stability  .of  solutions).  In  this 
thesis  we  will  consider  other  forms  of  the  stabilizing  functional  which  we  will  denote 
by  M(x).  Hence  we  will  be  considering  variational  principles  of  the  from 

\\Kx-  y||a  +  AM(x)  (2.6.25) 

The  regularization  parameter  A  controls  the  degree  to  which  a  solution  is  reg¬ 
ularized.  Small  values  of  A  compromise  the  degree  of  regularization  in  favor  of 
accurately  matching  the  initial  data.  Very  large  values  of  A  may  result  in  very 
regular  but  unrealistic  solutions, 

2.7  Regularizing  the  Tactile  Sensing  Problem 

To  regularize  the  inverse  problem  of  tactile  sensing  it  is  necessary  to  first  identify 
the  generic  physical  constraints  that  may  be  imposed  upon  the  solution.  In  the 
case  of  normal  stress  applied  to  the  the  compliant  pad  (see  Figure  2.3),  it  is  clear 
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that  the  unisense  nature  of  the  compressive  loading  on  the  boundary  can  be  cap¬ 
tured  by  constraining  solutions  to  lie  in  the  positive  orthant.  To  further  suppress 
some  of  the  deleterious  effects  of  sensor  noise,  the  solutions  may  be  constrained 
to  be  smooth.  Constraining  solutions  to  be  smooth  may  result  in  inaccurate  so¬ 
lutions  near  physical  edges,  however,  edges  may  be  recovered  in  a  second  stage  of 
regularization. 

The  constraints  of  nonnegativity  and  smoothness  of  the  solutions  can  be  em¬ 
bodied  in  the  stabilizing  functional  by  choosing 

M(x)  =  z.Tog  X;  (2.7.26) 

t 

which  has  the  same  functional  form  as  Shannon  entropy.  Hence  in  the  case  of  the 
inverse  problem  of  tactile  sensing,  the  problem  is  to  find  a  vector  /  6  HR.N  which 
minimizes 

l|T/„-ei||J  +  A2:/v,log/„,  (2.7.27) 

t 

where  T  £  WlNxN  is  a  finite  matrix  approximation  to  the  convolution  operator, 
e*  €  R*  is  the  vector  of  measured  strains  and  /„  £  1RW  is  the  vector  of  stress 
components.  (The  norm  ]  |  -  |  j  is  the  standard  Euclidean  norm  on 

Since  the  choice  of  M(x)  in  (2.7.26)  is  not  of  a  form  for  which  there  exists  a 
large  body  of  results  regarding  existence,  uniqueness  and  stability  of  solutions,  such 
properties  must  be  verified. 

2.7.1  Uniqueness  of  Solutions 

Uniqueness  of  solutions  to  (2.7.27)  is  demonstrated  in  the  following  lemma. 
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Lemma  2.2  Given  A  £  RnXn  symmetric ,  x  £  Etn  and  y  £  W1 , there  exists  at  most 
one  solution  to  the  problem 


min 

X  £  Rn  \\Ax  -  2/| |2  +  A  2jx,-logx,- 


(2.7.28) 


Proof 

Let 


M(x)  =  || Ax  -  y||2  +  A  xi  log  Xi 


Then, 


/,  \ 

logXi 


v  M(x)  =  2 At(Ax  -  y)  + 


+  1 


(2.7.29) 


y  logx„  j 

Where  1  =  (1, 1, ... ,  1)T. 

Assume  there  exists  two  solutions  x  and  x  to  (2.7.28).  Then  since  at  the  solution 
VAf(i)  =  0,  we  have 


V  M(x)  =  0  and  y  M{x)  =  0. 


(2.7.30) 


From  (2.7.29)  and  (2.7.30), 


2 AT(Ax  -y)  + 


^  logXi 


log  xn  J 


+  1  =  0 


(2.7.31) 


and 
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(  logfj  \ 


2At(Ax  -y)  + 


+  1=0 


log  xn  J 


Subtracting  (2.7.32)  from  (2.7.31), 


(2.7.32) 


( -  log  x  log  X\ 


\ 


2AtA(x  -  x)  =  - 


^  log  xn-xn  J 

Taking  the  inner  product  of  both  sides  of  (2.7.33)  with  (x  —  x), 


(2.7.33) 


log  X  —  log  X\ 


(x  —  x)  •  2At A(x  —  x)  =  —  (x  —  x)  ■ 


(2.7.34) 


y  log  xn  -  xn  j 

Since  AT A  is  positive  semidefinite  and  symmetric  and  the  log  function  is  mono- 
tonically  increasing,  the  left  hand  side  of  equation  ( 2.7.34 )  greater  than  zero  while 
the  right  hand  side  is  less  than  zero  which  is  a  contradiction. 

Hence  it  is  necessary  that  x  =  x  i.e.  there  is  at  most  one  solution.  ■ 
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CHAPTER 

THREE 


NONLINEAR  ELECTRICAL  NETWORKS 


Nonlinear  electrical  networks  have  been  a  topic  of  active  research  for  many 
years.  Hence  there  exists  a  large  body  of  results  pertaining  to  such  networks.  As 
elaborated  in  Chapter  1 ,  the  transition  from  neural  networks  to  electrical  networks 
is  not  only  trivial,  but  natural.  Mathematical  analysis  of  neural  networks  is  as  yet 
a  developing  field.  It  seems  natural  to  look  to  the  available  results  for  nonlinear 
electrical  networks  for  insight  and  understanding  of  the  behavior  of  neural  networks. 

In  this  chapter  an  attempt  is  made  to  demonstrate  one  particular  application 
of  electrical  network  analysis  to  neural  networks.  We  define  an  energy  function  for 
a  dynamical  system  as  a  functional  which  is  minimized  (globally  or  locally)  as  a 
result  of  the  natural  time  evolution  of  the  system.  In  Chapter  4  the  results  of  this 
chapter  are  applied  to  determine  an  energy  function  for  a  neural  network  designed 
to  solve  the  inverse  problem  described  in  Chapter  2. 

Some  of  the  earliest  work  on  electrical  networks  was  done  by  James  Clerk 
Maxwell  in  1873  [18]  and  is  concerned  with  the  distribution  of  currents  and  volt¬ 
ages  in  linear  resistive  networks.  Later  work  by  Tellegen  [19],  Cherry  (1951)  [20], 
and  Millar  (1951)  [21]  helped  to  build  a  foundation  for  nonlinear  network  analy¬ 
sis.  In  1964  Brayton  and  Moser  [22]  attempted  to  build  a  more  general  theory  of 
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nonlinear  networks  by  considering  some  geometric  aspects  of  such  networks.  More 
recently,  several  researchers  have  adopted  an  even  more  general  geometric  view  of 
nonlinear  networks  (see  e.g.  [22],  [23],  [24], [25],  and  [26].  In  these  latter  works, 
network  dynamics  are  viewed  as  flows  (differential  equations)  on  nontrivial  man¬ 
ifolds  (nonlinear  spaces).  It  is  clear  that  a  great  deal  of  powerful  machinery  has 
been  developed  for  analysis  of  nonlinear  networks.  Applications  of  the  same  tools 
and  body  of  results  to  neural  networks  should  prove  useful. 

3.1  Basics  of  RC  Electrical  Networks 

We  consider  a  network  composed  of  branches  and  nodes  with  the  restriction  that  a 
branch  connects  exactly  two  nodes.  Arbitrarily  assigning  a  direction  to  the  branch 
currents,  we  define  as  the  current  flowing  from  the  initial  node  to  the  end  node  of 
the  /^th  branch  in  the  network.  The  branch  voltage  is  defined  as  the  voltage  rise 
measured  from  the  end  node  to  the  initial  node  of  the  /zth  branch  in  the  network. 

For  any  network,  a  complete  set  of  generalized  current  or  voltage  coordinates  can 
be  chosen.  Such  a  set  of  variables  is  complete  in  the  sense  that  they  can  be  assigned 
values  independently  without  violation  of  KirchofPs  laws  and  that  they  determine 
in  each  branch  of  the  network  one  of  the  two  variables,  branch  current  or  branch 
voltage.  In  computing  m,  the  number  of  defining  current  coordinates,  constant 
current  sources  if  any  are  not  counted  as  branches  and  similarly  in  computing 
n,  the  number  of  defining  voltage  coordinates,  constant  voltage  sources  are  not 
counted  as  branches.  In  the  particular  case  of  a  RC  network,  a  complete  set  of 
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variables  is  obtained  by  considering  the  voltages  across  all  independent  capacitive 
branches.  Two  capacitors  in  parallel  are  considered  as  a  single  capacitive  branch 
with  capacitance  equal  to  the  parallel  combination  of  the  two.  We  will  denote  the 
complete  set  of  variables  for  a  RC  network  by  v*  =  (v±, . . . ,  vn). 

In  the  theory  of  electrical  circuits  there  are  two  fundamental  laws,  namely 
KirchofPs  laws  which  form  the  basis  for  analysis  of  any  electrical  circuit.  Kirchoff’s 
current  or  node  law  states  that  the  sum  of  currents  at  any  node  in  a  network  must 
be  zero,  i.e., 

£ *'  =  o. 

node 

Kirchoff’s  voltage  or  loop  lav/  states  that  the  sum  of  voltages  around  any  loop  in 
the  network  must  be  zero,  i.e. 

=  °- 

loop 

Among  the  first  theorems  for  linear  electrical  networks  is  Maxwell’s  Minimum  Heat 
Theorem  (1837)  which  is  stated  below  in  modern  terminology. 

Theorem  3.1  (Maxwell’s  Minimum  Heat  Theorem)  For  any  linear,  time- 
invariant,  resistive  network  driven  by  voltage  and/or  current  sources,  of  all  the 
current  distributions  consistent  with  Kirchoff’s  current  law,  the  only  distribution 
which  is  also  consistent  with  Kirchoff’s  voltage  law  and  therefore  the  true  distribu¬ 
tion  is  the  one  which  minimizes  the  quantity  (W  —  2PV)  where  W  is  the  total  power 
dissipated  by  the  resistors  and  Pv  is  the  total  power  supplied  by  the  voltage  sources. 


Having  chosen  a  set  of  generalized  current  coordinates  say  for  example  z'i, . . . ,  im, 
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Maxwell’s  theorem  leads  to  the  following  set  of  m  equations, 

— (W-2PV)  =  0  j  —  1 , . . . ,  m  (3.1.1) 

which  when  solved  for  the  current  coordinates  ii, . . . ,  im  determine  all  variables  in 
the  network. 

The  dual  of  Maxwell’s  theorem  states  that  for  the  above  network,  among  all 
voltage  distributions  which  are  consistent  with  Kirchoff’s  voltage  law,  the  only 
one  which  is  also  consistent  with  Kirchoff’s  current  law  and  therefore  the  true 
distribution  is  the  one  which  minimizes  {W  —  2 Pi),  where  Pj  is  the  total  power 
supplied  by  the  current  sources.  This  dual  theorem  can  be  used  to  solve  for  all 
variable  in  the  network  by  first  solving  for  the  generalized  voltage  coordinates. 

Maxwell’s  theorem  tells  us  that  the  quantities  ( W  —  2 Pv)  and  (W  —  2 Pi)  are 
stationary  with  respect  to  the  distributions  of  currents  and  voltages  in  the  network 
respectively.  It  can  be  shown  that  these  quantities  are  not  in  general  stationary  in 
the  case  of  networks  containing  nonlinear  elements.  Hence  to  solve  for  voltages  and 
currents  in  a  nonlinear  network  in  an  analogous  manner  we  must  determine  first 
the  stationary  quantities.  Millar  [20]  determined  such  quantities  and  we  present 
first  some  definitions  relating  to  nonlinear  networks  before  we  state  his  results. 


Definitions  Pertaining  To  Nonlinear  Networks 
Elements:  Two  terminal  elements  can  in  general  be  described  by  a  relationship 
of  the  form, 

di  (Pi  dv  (Pv 

=  0  (3'L2) 

Where  v  is  the  voltage  across  the  two  terminals  of  the  element  and  i  is  the  current 
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through  the  element. 

Nonreactive  elements  are  those  for  which  the  dependence  on  time  derivatives  in 
equation  (3.1.2)  is  absent,  i.e.  they  can  be  described  by  f(i,v,t)  =  0. 

Time  invariant  elements  are  those  for  which  the  defining  function  is  not  explic¬ 
itly  time  dependent 

For  time  invariant  nonreactive  elements  the  locus  of  f(i,v)  =  0  is  called  the 
characteristic  curve  of  the  element.  For  general  time  invariant  elements,  the  i- 
v  curve  represents  a  trajectory  in  the  phase  space  of  the  element  viewed  as  a 
dynamical  system. 

A  time  invariant  nonreactive  element  is  said  to  be  passive  if  the  characteristic 
curve  intersects  the  i-v  axes  at  no  point  other  than  the  origin.  Otherwise  the 
element  is  said  to  be  active. 

Sources :  A  current /voltage  source  is  a  time  invariant  active  nonreactive  element 
for  which  voltage/current  is  absent  from  the  defining  function 

3.2  Invariants  of  Motion  in  Nonlinear  Electrical  Networks 

Two  quantities  defined  by  Millar  [20]  relating  to  elements  of  a  nonlinear  network 
are  the  ‘content’  and  the  ‘co-content’  of  an  element. 

3.2.1  Content  and  Co-content 

Let  (fi,Ui)  be  a  point  on  the  i  —  v  curve  of  a  two  terminal  element.  The  ‘content’ 
of  the  element,  denoted  by  G  is  defined  by, 

G  =  f  v  di.  (3.2.3) 

Jo 
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The  ‘co-content’  of  the  element  is  denoted  by  J  and  is  defined  by, 


-fi 


dv 


(3.2.4) 


We  observe  that  for  a  passive  element  the  total  power  dissipation  is  W  =  J  +  G 
and  that  for  a  linear,  passive  element  J  =  G  =  W/2 

The  total  content  (co-content)  of  a  network  is  defined  as  the  sum  of  contents 
(co-contents)  of  all  constituent  elements  including  current  and  voltage  sources.  We 
will  use  G  and  J  to  denote  the  total  content  and  co-content  respectively.  We  will 
use  Gjk  and  Jjk  to  denote  the  content  and  co-content  of  the  element  connecting 
nodes  j  and  k. 


3.2.2  Stationary  Quantities 

The  following  theorem  identifies  G  and  J  as  stationary  quantities. 

Theorem  3.2  (Millar)  If  in  an  active  (possibly  reactive)  network,  the  sum  J  of 
the  co-contents  of  all  the  constituent  elements  is  expressed  in  terms  of  the  defining 
number  of  voltage  coordinates  of  the  network  subject  only  to  the  restrictions  of 
Kirchoff’s  voltage  law,  then  J  is  stationary  for  the  actual  distribution  of  voltages. 


The  dual  of  this  theorem  is  obtained  by  replacing  co-content  by  content,  J  by 
G,  and  voltage  by  current  everywhere. 

Theorem  3.2  provides  us  with  the  equivalent  of  Maxwell’s  theorem  for  nonlinear 
networks  of  time  invariant  elements.  To  restate  the  above  theorem  in  a  manner 
analogous  to  Maxwell’s  theorem,  we  can  say; 
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In  any  active  (possibly  reactive )  network,  of  all  distributions  of  current/voltage 
consistent  with  Kirchoff’s  current/voltage  law,  the  ones  for  which  G/J  is  stationary 
are  the  only  ones  that  are  also  consistent  with  Kirchoff’s  voltage/current  law  and 
are  thereby  the  true  distributions. 

Thus  if  we  express  G/  J  in  terms  of  the  defining  number  ( m/n )  of  current/ voltage 
coordinates,  we  can  determine  all  currents  and  voltages  in  the  network  by  solving 
either  one  of  the  following  sets  of  simultaneous  partial  differential  equations, 

dG  n  , 

—  =  0  r  =  l,...,m  ,  (3.2.5) 

or 

dJ 

^-  =  0  q  —  1, . . . ,  n  .  (3.2.6) 

3.2.3  Invariants  of  Motion 


The  next  theorem  identifies  G  and  J  as  invariants  of  motion.  That  is,  as  a  net¬ 
work  evolves  in  time  following  an  impulsive  change  in  one  or  more  of  the  current 
or  voltage  sources,  the  total  content  and  the  total  co-content  of  the  network  are 
conserved. 


Theorem  3.3  (Millar)  In  any  network  of  time  invariant  elements  (possibly  in¬ 
cluding  sources),  the  total  content  G  and  the  total  co-content  J  are  invariants  of 
motion,  i.e. 


and 


(3.2.7) 
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Where  ^  and  ^  are  the  total  time  derivatives  of  G  and  J  given  by, 

dG_dG  dG_dn  dG  dim 

dt  dt  di\  dt  ^  dim  dt 

and 

dj  dJ  dJ  dv  i  dJ  dvn  . 

Tt~  dt+d7,ir  +  '"  +  d7„ It  (3'2'8) 

where  m  is  the  defining  number  of  generalized  current  coordinates  and  n  is  the 
defining  number  of  generalized  voltage  coordinates.  ■ 

3.3  Energy  Functions  for  Nonlinear  RC  Electrical  Networks 

To  derive  an  energy  function  for  nonlinear  networks  of  the  type  described  we  first 
restate  and  prove  two  theorems.  The  first  of  these  theorems  is  known  as  Tellegen’s 
Theorem  [18],  but  is  presented  as  in  [21]  so  as  to  have  a  geometrical  interpretation. 
The  second  theorem  is  stated  by  Brayton  and  Moser  in  [21]. 

In  a  directed  network  with  b  branches  and  m  nodes,  the  set  of  branch  currents 
i  =  aQd  the  set  of  branch  voltages  v  =  (wi, ...,  u&)  are  vectors  in  a  b- 

dimensional  Euclidean  vector  space  £b  with  the  inner  product  defined  by  <  x,y  >= 
y;>=1  2^2/m.  Let  I  be  the  set  of  all  vectors  in  £b  such  that  if  i  £  X  then  the  constraint 
of  Kirchoff’s  current  law  is  satisfied  for  each  node  in  the  network,  i.e.  Ylnode  V  =  0. 
Similarly,  let  V  be  the  set  of  all  vectors  in  £b  such  that  if  v  6  V  then  Kirchoff’s 
voltage  law  is  satisfied,  i.e  Y2ioop  =  0.  1  and  V  are  clearly  subspaces  of  £b  since 
they  are  defined  via  linear  relationships  (Kirchoff’s  Laws).  The  following  theorem 
shows  that  X  and  V  are  orthogonal  subspaces  of  £b . 
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Theorem  3.4  (Tellegen’s  Theorem)  IfiEl  and  v  6  V  then 


<*>>=  0.  (3.3.9) 

Proof: 

Let  (Vi,...,Vn)  be  the  set  of  node  voltages  such  that  v  is  the  vector  of  voltage 
differences  between  the  end  nodes  and  the  initial  nodes  of  each  branch,  let  i^i  be 
the  current  from  node  k  to  node  l.  Then  if  the  branch  connecting  node  k  to  node  l 
is  labeled  as  the  gth  branch, 


=  (V*  —  V{)iki  =  (Vi  —  Vk)iik 


Due  to  the  symmetry  in  k  and  l, 


1 


J2  Vm  = 


n=l 


k  1 


but, 


and 


Vk  ikl  ~  iL  V‘  H ***) 

Z  k  l  Ik 


]C  ik‘  - 

l  node k 


02  ik> =  ]C  ^  v 

k  node l 


(3.3.10) 


Since  the  branch  currents  in  the  neiwork  must  satisfy  Kirchoff’s  current  law  we 
know  Ylnode  =  0.  Therefore  =  0  ■ 


Theorem  3.5  (Brayton— Moser)  Let  F  denote  a  one  dimensional  curve  inlxV 
with  coordinates  denoted  by  i  and  v.  Then, 


Proof:  Since  (dii, . . . ,  dif,)  G  T, 


<  v,di  >=  ^2  Vndif,  =  0  '  (By  Tellegen’s  theorem). 
m=i 

Integrating  the  above  sum  along  T  we  get, 


Integrating  by  parts  we  see  that, 


<  v,  t  > 


IP 


M dv n  —  0. 


By  Tellegen’s  theorem  we  know  <  v,i  >=  0,  therefore, 

f  b 

/  Y]  iudvn  =  0. 

Jr£i 


(3.3.12) 


(3.3.13) 


Having  stated  the  above  two  theorems  and  having  defined  the  complete  set  of 
variables  u*  =  (vi, . . .  ,  u/v)  for  the  network,  we  proceed  with  the  derivation  of  an 
energy  function. 

From  Theorem  3.5  we  know  that  jr  £*=1  v^di^  =  0.  We  choose  F  from  a  fixed 
initial  point  to  a  variable  end  point  in  £b  such  that  along  T  the  characteristic 
relationships  of  the  constituent  elements  of  the  network  are  satisfied.  We  can  write 
equation  (  3.3.13)  in  the  following  form: 


The  first  integral  is  over  all  capacitive  branches  and  the  second  is  over  all  other 
branches.  Integrating  the  first  integral  by  parts  we  get, 


(3.3.15) 


where  , 


P  = 


.6  .  N  - 

L  H  ®  a  +  Y2  ipv 

1  /i=AT+l  p= 1 


(3.3.16) 


Therefore  from  (  3.3.15)  we  also  have, 


,  w 

P  =  -  ipdvp. 

Jr  p= i 


(3.3.17) 


Let  tpdup.  We  now  establish  the  conditions  for  f  to  be  integrable  i.e 

the  conditions  for  the  line  integral  (  3.3.17)  defining  P  to  be  independent  of  the 
path  of  integration  T. 

For  (  3.3.17)  to  be  independent  of  T,  it  is  necessary  that  £  be  a  perfect  differ¬ 
ential.  That  is,  we  can  write  (  as,  £  =  da,  where  a  =  a(vi, . . . ,  v u)  and, 

.da  ,  da 

da  =  -z—dvi  + - 1-  - — dvN.  (3.3.18) 

dv  i  OvN  K  ’ 


But  since  £  =  da  we  have, 


da  —  i\dv  i  +  •  •  •  +  ijjdvN. 


(3.3.19) 


Comparing  (  3.3.18)  and  (  3.3.19)  we  get, 


da 

lp  =  dZ 


p  - 


Therefore 

dip  _  d2a  div 

dvv  dvpdvv  dvp  ’  ’’ 


(3.3.20) 


(3.3.21) 
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From  the  above  equation  (  3.3.21)  we  see  that  in  order  for  (  3.3.17)  to  be  indepen¬ 
dent  of  F,  the  following  must  hold: 

d^^lhTk  hk  =  l,...,N  j^k.  (3.3.22) 

If  (  3.3.22)  holds,  then  P  is  a  function  of  the  endpoints  of  T  alone. 

Assuming  that  the  line  integral  (  3.3.17)  is  independent  of  path  we  can  write 
for  the  currents  in  the  capacitive  branches, 


dP 

dva 


(3.3.23) 


However,  we  know  from  the  the  dynamical  law  of  capacitors  that, 


:  _  n 
lp  ~  Cp  dt 


P  =  i,--.,iv. 


(3.3.24) 


Therefore, 


r  ^p. 

dt 


dP(v*) 

dv0 


(3.3.25) 


where  v*  =  (tq, . . . ,  v n).  We  now  write  the  system  of  differential  equations  defining 
the  dynamical  behavior  of  the  network  in  the  following  vector  form, 


-Cx 


dP{x ) 
dx 


(3.3.26) 


where  x  —  v*,  C  —  diag(C\,  ...,Cjv),  and  dPQ^  is  the  gradient  of  P(x).  The 
following  theorem  relates  P{x)  to  the  equilibrium  states  of  the  network. 


Theorem  3.6  Let 

—  =  xeU  H  :U  R  (3.3.27) 

dt  ox 
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where  U  is  a  normed  vector  space  and  B  is  a  symmetric  positive  definite,  n  x  n 
matrix  function  of  x.  Then  ^ H(x(t ))  <  0  for  all  x  £  U  and  ^ H{x{t ))  —  0  if  and 
only  if  x  is  an  equilibrium  of  the  gradient  system  (  3.3.27). 

Proof: 

Taking  the  time  derivative  of  H  along  trajectories, 

dH(x)  . dH(x )  dx 

dt  dx  ’  dt 

f0H{x)  ndH(x )x 

=  {-~dT'-B~dr) 

_  nmM(x)  2 

B  dx 

Since  B  is  positive  definite  B 1^2  is  nonsingular  and  this  completes  the  proof.  ■ 

Corollary.  If  x  is  an  isolated  minimum  of  H(x)  then  x  is  an  asymptotically 
stable  equilibrium  of  the  gradient  system  (  3.3.27). 

As  the  matrix  C  is  positive  definite,  it  is  obvious  that  the  function  P{x)  which 
we  shall  call  the  energy  function,  decreases  along  solutions  to  (  3.3.26)  except  at 
equilibrium  points.  Therefore  the  equilibrium  states  of  the  network  correspond  to 
stationary  points  of  P(x)  and  the  local  minima  of  P(x)  are  the  stable  equilibria.  If 
in  addition  to  this  P(x)  -+  oo  as  ||i|j  — ►  oo  then  it  can  be  shown  that  all  solutions 
to  (  3.3.26)  approach  one  of  the  set  of  equilibrium  solutions  as  t  — ►  oo. 

We  observe  that  P(v*)  is  just  the  negative  of  the  co-content  of  all  independent 
capacitive  branches  in  the  network.  The  total  co-content  of  the  network,  J,  is  an 
invariant  of  motion  (Millar  [20])  even  for  dissipative  systems  where  the  total  energy 
is  not  conserved.  The  analogy  to  kinetic  and  potential  energy  of  a  nondissipative 
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CHAPTER 

FOUR 


NEURAL  NETWORKS  FOR  SOLVING 
THE  INVERSE  PROBLEM 


In  Chapter  1  it  was  noted  that,  in  all  but  the  most  rudimentary  applications  of 
tactile  sensing,  real-time  processing  of  tactile  sensory  data  is  crucial.  In  Chapter 
2  it  was  shown  that  the  inverse  problem  of  tactile  sensing  may  be  formulated  as 
a  variational  principle.  Furthermore,  the  inclusion  of  a  regularizing  penalty  func¬ 
tional  in  the  variational  principle  provides  immunity  to  sensor  noise  and  transforms 
the  problem  to  one  that  is  well-posed.  Solutions  to  the  inverse  problem  (obtained 
as  solutions  to  a  variational  principle,  require  the  minimization  of  the  nonlinear 
cost  functional  in  equation  (2.7.27).  Real-time  solutions  to  such  a  minimization 
problem  would  require  fairly  powerful  digital  hardware  using  standard  iterative  al¬ 
gorithms.  Since  it  is  our  objective  to  obtain  fast  solutions  to  the  inverse  problem 
as  well  as  limit  the  hardware  complexity  to  that  which  can  be  located  in  close 
proximity  to  individual  sensor  arrays,  analog  neural  networks  provide  an  attractive 
alternative. 

In  Section  4.1  an  analog  neural  network  is  described  which  has  been  structured 
so  as  to  solve  the  regularized  inverse  problem  of  tactile  sensing.  The  network 
described  also  exploits  any  inherent  parallelism  in  the  problem  since  solutions  are 
obtained  as  the  result  of  asynchronous  relaxation  of  all  variables  in  the  network. 
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In  Section  4.2  it  is  shown  that  the  network  described  in  Section  4.1  satisfies  the 
hypotheses  of  Theorem  3.7  and  thus  an  energy  function  for  the  network  is  explicitly 
given  by  the  theorem.  It  is  shown  that  the  resultant  energy  function  is  indeed  the 
variational  principle  of  equation  (2.7.27). 

Computer  simulations  of  the  network  are  used  in  Section  4.3  to  evaluate  the 
effect  of  the  regularizing  parameter  A  on  the  resultant  solutions  and  to  evaluate 
performance  of  the  network  in  the  presence  of  sensor  noise. 

4.1  The  Maximum  Entropy  Deconvolution  Network 

Inspired  by  the  work  of  Tank  and  Hopfield  (see  [27])  on  neural  networks  for  solving 
optimization  problems,  Marrian  and  Peckerar  [28]  suggested  a  neural  network  for 
solving  deconvolution  problems  with  an  entropy-like  regularizer.  Figure  4.1  shows  a 
schematic  of  a  two  channel  deconvolution  network.  The  network  proposed  consists 
of  two  planes  of  amplifiers  (i)  the  signal  plane  and  (ii)  the  constraint  plane  (see 
Figure  4.1).  Inputs  to  the  constraint  plane  (e  =  (el5 . . . ,  £at)t)  are  currents  pro¬ 
portional  to  sample  strain  measurements  obtained  from  an  array  of  tactile  sensors. 
Outputs  of  the  signal  plane  (v  =  (uj, . . . ,  i’jv)T)  are  voltages  which,  in  equilibrium, 
represent  regularized  solutions  to  the  inverse  problem.  The  interconnections 
are  conductances  (resistors  with  values  1/7Y,)  corresponding  to  elements  of  the 
matrix  representation  T  of  the  discretized  convolution  kernel.  Amplifiers  in  the 
signal  plane  axe  exponential  i.e.  g(x)  =  exp(x)  and  constraint  plane  amplifiers  are 
linear  with  gain  s  (f(x)  =  sx).  To  intuitively  understand  the  manner  in  which 
this  deconvolution  network  operates,  dynamical  evolution  of  the  network  can  be 
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Signal  Plane 


Constraint  Plane 


Figure  4.1:  N-Channel  Maximum  Entropy  Deconvolution  Network. 
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viewed  as  occuring  in  a  series  of  infinitesimal  discrete  time  steps.  If  evolution  of  the 
network  is  viewed  in  this  manner,  the  feedback  loops  generate  a  number  of  ‘analog 
iterations’.  Each  analog  iteration  is  approximately  composed  of  the  following  steps: 

1.  Outputs  of  the  signal  plane  are  convolved  with  the  discretized  kernel  T  in 
the  constraint  plane,  forming  the  vector  T  ■  v. 

2.  Error  in  the  current  estimate  of  the  solution  (given  by  outputs  of  the  signal 
plane)  is  evaluated  in  the  constraint  plane  by  subtracting  the  input  strain 
vector  e  from  the  results  of  the  previous  step  i.e  the  vector  T  ■  v  —  e  is  formed 
and  fed  back  to  the  signal  plane  through  the  constraint  plane  amplifiers  with 
gain  s. 

3.  Based  upon  feedback  from  the  constraint  plane,  the  outputs  of  the  signal 
plane  are  updated  so  as  to  reduce  the  error. 

4.  If  outputs  of  the  signal  plane  have  not  yet  settled  repeat  (l)-(3) 

It  remains  to  be  shown  that  the  above  series  of  ‘iterations’  do  indeed  converge, 
i.e.  that  the  network,  as  an  analog  electrical  circuit,  is  stable.  In  the  next  section 
the  results  of  Chapter  3  are  used  to  demonstrate  the  stability  of  the  network  and 
to  show  that  the  outputs  of  the  signal  plane  converge  to  regularized  solutions  to 
the  inverse  problem.  Since  (as  will  be  shown  in  the  next  section)  minimization 
of  equation  (2.7.27  also  attempts  to  maximize  the  entropy  of  the  solution,  we  will 
refer  to  the  network  in  Figure  4.1  as  the  maximum  entropy  deconvolution  network 
(or  MaxEnt  network  for  short). 
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4.2  Energy  Function  for  Maximum  Entropy  Deconvolution  Network 


Stability  of  the  MaxEnt  deconvolution  network  can  be  established  by  applying 
Theorem  3.7  which  also  determines  explicitly  an  energy  function  for  the  network.  In 
the  following  it  is  assumed  that  for  the  MaxEnt  network,  any  dynamics  associated 
with  the  Constraint  plane  amplifiers  are  negligible.  This  assumption  is  reasonable 
since  the  feedback  capacitor  associated  with  the  signal  plane  amplifiers  can  be 
chosen  so  that  the  response  of  the  signal  plane  amplifiers  is  sufficiently  slower 
than  those  of  the  constraint  plane.  In  order  to  apply  Theorem  3.7  it  is  necessary 
to  first  verify  that  the  network  satisfies  the  two  hypotheses.  From  Figure  4.1 
it  is  clear  that  the  voltages  across  the  signal  plane  capacitors  ui,...,un  form  a 
complete  set  of  variables  for  the  network,  i.e  Ui,...,«jv  can  be  assigned  values 
arbitrarily  and  that  they  determine  in  every  branch  of  the  network  one  of  the  two 
variables,  branch  current  or  node  voltage.  It  is  intuitively  obvious  that  a  ‘potential’ 
function  should  depend  only  upon  the  current  state  of  the  network  and  the  fixed 
‘zero’  reference  potential  chosen.  To  establish  path  independence  of  the  integral  in 
equation  (3.3.17)  (i.e.  to  establish  that  (3.3.17)  depends  only  upon  the  endpoints 
of  T)  the  ‘mixed-partials’  condition  of  (3.3.22)  is  used. 

From  Figure  4.1,  the  current  through  the  capacitor  connected  to  the  nth  signal 
plane  node  is  given  by, 

=  cl£  =  ~^tk^Tk ' u  “ efc)-  t4-2-1) 

Here  Tk  =  (fjti,  ffc2,  •  ■  • ,  ijwv)T*  Since  un  =  p-1(un),  the  current  in  can  also  be 
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written  as,  x, 


•  _  /~i  dun  __  9  (^n)  1  ,  r/rp  \ 

ln  —  6/  ,,  —  n  r-,  tfcn/(Tfc  •  U  £fc). 


dt 


R  R 


(4.2.2) 


The  energy  function  P  is  thus  a  function  of  v  =  (w1? _ ,v^)T  i.e.  v  can  be  used  as 

an  alternate  set  of  independent  variables  and 


P(v)  =  -  J  Yj^dvn. 


From  (  4.2.2)  it  is  seen  that, 


and, 


dvj  —  dv, '  (  Hktknf(Tk'V  efc)) 

=  —  Efc  tkntkjf'{Tk‘V  —  ek) 

k,j,n  =  l,...,iV,  j  ~fi  n 


^  ~  ^-(-E  khjfin-v-tk)) 


(4.2.3) 


(4.2.4) 


dv„  dvn 

=  -Zktkjtknf,(Tk.v-ek)  (4-2.5) 

=  |j£  k,j,n  =  l,...,iV,  j  ±  n. 

Thus  the  mixed  partials  conditions  of  (3.3.22)  are  satisfied. 

The  dynamical  equations  of  the  network  can  now  be  written  in  the  following 

form: 

< In  -  r)  P( id 

(4.2.6) 


du  __  LdP(v(u)) 
dt  d  v(u) 


where  v  =  (tq  . . .  ,Ujv)  and  vn  =  g  1(un).  Since  V{  =  g{u{), 


dv  _  rr-i  dP(v) 
dt  G  dv  ’ 


(4.2.7) 


Tt  has  been  assumed  here  that  the  amplifier  function  g(-)  is  bijective  which  is  clearly  true  for 
g(x)  =  exp(ar). 
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where  G  —  diag(g'(u\), . . .  ,j'(iin)).  Because  g(u)  =  exp(u),  is  a  monotone  increas¬ 
ing  function  G  is  always  positive  definite. 

Applying  Theorem  3.6  to  (  4.2.7),  it  is  clear  that  the  equilibrium  states  of  the 
network  must  correspond  to  stationary  points  of  P(-). 

In  order  to  understand  the  nature  of  the  equilibrium  states  (if  any  exist)  of  the 
network  we  must  evaluate  the  integral  expression  for  P. 


pm  =  /r  £  (- 1  +  £  w(n  •  v  -  «)) 


(4.2.8) 


Choosing  0  as  the  starting  point  of  the  path  T,  we  can  write  (  4.2.8)  as, 

fvn  Sf“1(yn)  ;  ^  fVn  dvn 


R 


+  f  Y,Yltknf(Tk  ■  U  ~  ek)dvn. 
JV  n  k 

If  we  let  F(zk)  be  such  that  dF(zk)/dzk  =  f(zk)  then, 


(4.2.9) 


(4.2.10) 


For  the  deconvolution  network  shown  in  Figure  4.1  the  signal  plane  amplifiers  are 
characterized  by  g(u)  —  exp(u).  Therefore 


f 


Vn  g 

R 


dvn  —  vn  log  vn 


(4.2.11) 


From  Figure  4.1  we  also  see  that  the  characteristics  of  the  constraint  plane  ampli¬ 
fiers  are  given  by  f(z)  —  sz  where  s  is  a  constant  defining  the  feedback  gain.  We 
can  thus  define  the  function  F(Tk  •  v  —  e*)  by, 


F(Tk-v~ek)=S-(Tk-v-ek)2.  (4.2.12) 
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where  G  =  diag(g'(u1), . . . ,  g'(uN)).  Because  g(u )  =  exp(u),  is  a  monotone  increas¬ 
ing  function  G  is  always  positive  definite. 

Applying  Theorem  3.6  to  (  4.2.7),  it  is  clear  that  the  equilibrium  states  of  the 
network  must  correspond  to  stationary  points  of  P(-). 

In  order  to  understand  the  nature  of  the  equilibrium  states  (if  any  exist)  of  the 
network  we  must  evaluate  the  integral  expression  for  P. 


p(v )  =  frJ2  ^  tknf(Tk  ■  v  -  ek)j  dvn 

Choosing  0  as  the  starting  point  of  the  path  T,  we  can  write  (  4.2.8)  as, 

fVn  5_1(un) 


(4.2.8) 


u"  dvn 
~R 


+  /  YLYltknf{rPk-v-ek)dvn. 

Jr  n  k 


(4.2.9) 


If  we  let  F(zk )  be  such  that  dF(zk)/dzk  =  f(zk)  then, 


P(v)  =  E  r  €irldv’>  +  Ei+Enn-v- «*).  (4.2.10) 

For  the  deconvolution  network  shown  in  Figure  4.1  the  signal  plane  amplifiers  are 
characterized  by  g(u)  =  exp(u).  Therefore 


L 


V"  3  l(Vn) 
R 


dvn  -  vn  log  vn  -  vn. 


(4.2.11) 


From  Figure  4.1  we  also  see  that  the  characteristics  of  the  constraint  plane  ampli¬ 
fiers  are  given  by  f(z)  =  sz  where  s  is  a  constant  defining  the  feedback  gain.  We 
can  thus  define  the  function  F(Tk  •  v  —  e*)  by, 


F(Tk-v-ek)  =  -(Tk-v-ek)2. 


(4.2.12) 
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From  (  4.2.10)  and  (  4.2.11) 


p(v)  =  Y  ‘  v  ~  e*)2  +  ^YVnloSvn-  (4.2.13) 

k  “  “it 

It  is  clear  from  (  4.2.13)  that  P(u)  — ►  oo  as  ||v||  — >  oo.  Hence  all  solutions  to 
(  4.2.7)  approach  one  of  the  set  of  equilibrium  states  as  t  — *  oo.  In  Chapter  2  it 
was  shown  that  there  exists  a  unique  minimum  of  (  4.2.13)  which  correspond  to 
the  regularized  solution  of  the  inverse  problem.  Therefore  outputs  of  the  network 
will  converge  to  a  regularized  solution  of  the  inverse  problem. 

Equation  (  4.2.13)  gives  us  an  explicit  form  for  the  energy  function  for  the 
maximum  entropy  deconvolution  network  which  has  been  derived  using  nothing 
more  than  Kirchoff’s  laws. 

4.2.1  Introduction  of  other  Regularizers 

As  discussed  in  Chapter  2,  the  choice  of  a  regularizing  principle  must  be  based  upon 
the  physical  constraints  present  in  the  problem.  Also  in  Chapter  2  it  was  shown 
that  the  entropy  regularizer  is  appropriate  for  the  recovery  of  a  stress  distribution 
which  is  normal  to  the  compliant  sensing  pad.  In  the  case  of  tangential  stress 
distributions,  use  of  an  entropy  regularizer  would  be  inappropriate  since  tangential 
stress  is  not  unisense  in  nature.  However,  it  can  be  seen  from  equation  (4.2.10) 
that  the  network  structure  described  in  the  last  section  is  not  restricted  to  use  with 
an  entropy  regularzer  only.  It  is  clear  that  any  regularizer  which  can  be  written 
in  the  form: 
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can  be  introduced  into  the  energy  function  of  the  network  provided  g(-)  is  a  mono¬ 
tone  increasing2  function  which  can  be  implemented  as  the  characteristic  function 
of  an  analog  amplifier. 

It  has  been  demonstrated  in  this  chapter  that  analog  neural  networks  can  be 
structured  so  as  to  determine  solutions  to  problems  formulated  as  variational  prin¬ 
ciples.  Although  the  network  discussed  here  is  specifically  structured  to  solve  the 
inverse  problem  of  tactile  sensing,  the  results  of  Chapter  3  are  more  general  and 
can  be  applied  to  a  large  class  of  problems. 

4.3  Simulations 

Computer  simulations  of  the  maximum  entropy  deconvolution  network  were  per¬ 
formed  in  order  to  better  understand  behavior  of  the  network  in  terms  of  speed 
of  convergence,  noise  immunity  and  the  effect  of  the  regularizing  parameter  A. 
SIMNON3,  an  interactive  simulation  program  for  nonlinear  dynamical  systems  was 
used  to  simulate  the  network. 

For  the  purpose  of  simulation,  the  stress  distribution  at  the  surface  of  the 

compliant  layer  was  assumed  to  be  the  result  of  applying  pressure  normal  to  the 

surface  using  a  cylindrical  object  (see  Figure  4.2).  The  resulting  normal  stress 
2The  monotonicity  of  the  function  g(-)  is  necessary  to  satisfy  the  requirement  that  the  matrix 
G  in  equation  (4.2.7)  is  positive  definite. 

3SIMNON  was  provided  to  us  by  Professor  Astrom  from  the  Lund  Institute  of  Technology. 
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train  Sensors 


Elastic  Half  Space 


X 

Figure  4.2:  Application  of  stress  at  boundary  of  elastic  half-space  using  a  cylindrical 
object 

distribution  due  to  such  a  cylindrical  indentor  can  be  written  as  (see  [30]), 


fv{y)  = 


0  elsewhere 


(4.3.15) 


where  p  is  the  force  per  unit  length  and  a  is  the  halfwidth  of  the  contact  region. 
For  p  =  3  and  a  —  1  the  resulting  stress  distribution  is  shown  in  Figure  4.3.  This 
distribution  is  then  convolved  with  the  convolution  kernel  kx  relating  normal  stress 


to  strain  which  is  given  by, 


K(lJ  ~  2/o ) 


3  x(x2  -  (y  -  y0)2) 
27 tE  ( x 2  +  (y  -  j/o)2)2  ’ 


(4.3,16) 


where  the  modulus  of  elasticity  is  E  and  Poisson’s  ratio  for  the  material  is  assumed 
to  be  0.5.  Figure  4.4  is  a  plot  of  the  convolution  kernel  for  E  =  1  and  x  =  1.  The 
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Figure  4.3:  Stress  distribution  due  to  cylindrical  object  applied  to  surface  of  com¬ 
pliant  material  (P  —  3  and  a  —  l). 

resulting  ‘strain’  (see  Figure  4.5)  is  then  provided  as  input  to  the  network  which 
then  attempts  to  reconstruct  the  surface  stress  distribution  of  Figure  4.4. 

The  time  evolution  of  three  outputs  of  a  41  node  network  (using  strain  data 
which  has  not  been  corrupted  with  noise)  is  shown  in  Figure  4.6.  It  is  observed 
that  all  three  outputs  rapidly  converge  to  values  which  are  very  close  to  the  known 
solutions  (shown  by  the  horizontal  lines). 

Accurate  asessments  of  convergence  time  could  not  easily  be  made  using  digital 
computer  simulations.  If  the  signal  plane  capacitors  were  assigned  values  so  as  to 
reduce  convergence  time  (by  decreasing  the  RC  time  constant)  numerical  instability 
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kx(y-y0) 


y-y0 

Figure  4.4:  Elastic  convolution  kernel  (for  x  =  1  and  E  —  l), 

resulted.  By  decreasing  the  integration  time  step  the  problems  with  instability 
could  be  avoided  at  the  cost  of  tremendous  increases  in  the  time  necessary  to 
perform  a  simulation;  so  it  was  merely  observed  at  this  stage  that  the  network 
does  converge. 

To  evaluate  performance  of  the  network  in  the  presence  of  noise,  Gaussian  white 
noise  with  variance  a2  was  added  to  the  strain  from  which  surface  stress  was  to 
be  determined.  Figure  4.7  shows  the  reconstruction  of  surface  stress  obtained  by 
the  network  from  noisy  data  ( a 2  =  0.1)  with  A  =  0  i.e.  without  any  regulariza¬ 
tion.  It  can  be  seen  that  the  reconstruction  is  very  poor  due  to  multiple  peaks  and 
negative  solutions.  However,  in  comparison  to  the  reconstruction  obtained  under 
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Figure  4.5:  Strain  (at  depth  x  =  l)  resulting  from  cylindrical  indentation  of  elastic 
half-space, 

identical  conditions  using  the  Discrete  Fourier  Transform  (see  Figure  4.8)  the  net¬ 
work  solution  is  markedly  superior.  As  the  regularizing  parameter  A  is  varied  (see 
Figures  4.9-4.10),  varying  degrees  of  positivity  and  smoothness  are  imposed  upon 
the  solution.  For  A  =  0.1  (Figure  4.9)it  is  clear  that  although  the  solution  has  been 
constrained  to  the  positive  orthant,  the  degree  of  regularization  is  insufficient  for 
the  given  noise  conditions.  In  Figure  4.10  (A  =  100)  the  solution  has  been  over¬ 
regularized  since  solutions  which  should  have  been  close  to  zero  have  been  pushed 
away  from  zero  and  the  peak  of  the  distribution  has  been  greatly  suppressed.  Fig¬ 
ure  4.11  shows  reconstruction  obtained  using  A  =  10  which  is  the  ‘best’  of  the  three 
shown. 
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Figure  4.6:  Time  evolution  of  three  outputs  ((1)V 2i,(3)V^  and  (5)V\)  of  a  deconvo¬ 
lution  network  with  fl  output  nodes.  Corresponding  known  solutions  (fV2i,fv9  and 
fvl)  are  shown  by  the  horizontal  lines  (2),  (4),  and  (6). 


Clearly,  for  given  noise  conditions,  there  exists  a  value  of  the  regularizing  pa¬ 
rameter  A  for  which  the  reconstruction  is  optimal  in  some  sense.  Since  the  objective 
is  to  match  the  reconstruction  to  the  original  stress  distribution  as  closely  as  pos¬ 
sible,  it  is  reasonable  to  choose  a  value  of  A  for  which  the  discrepancy  between 
the  original  stress  distribution  and  the  network  reconstruction  is  minimized.  One 
measure  of  the  discrepancy  between  the  known  solution  and  the  network  solution 
is  the  mean  squared  error  which  can  be  written  as, 

MSB  =  4||/;  -  /.It  U'3'U) 
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Figure  4.8:  Reconstruction  of  surface  stress  from  noisy  strain  data  using  the  DFT 

approach  (l)Dcsigned  surface  stress,  (2) Reconstruction. 
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Figure  4.9:  Network  reconstruction  of  surface  stress  from  noisy  strain  data  for 
A  =  0.1  f ^Reconstruction,  (2)Designed  surface  stress. 


2 . 4 
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Figure  4.10:  Network  reconstruction  of  surface  stress  from  noisy  strain  data  for 

A  =  100  (1) Reconstruction,  (2) Designed  surface  stress. 


A  =  10  (1) Reconstruction,  (2)Designed  surface  sti-ess. 

where  /°  £  is  the  known  solution  to  the  inverse  problem  and  /„  6  is  the 
solution  obtained  by  the  network.  Such  an  approach  to  the  problem  of  choosing  A 
suggests  the  use  of  design  tools  such  as  CONSOLE  (see  [30])  which  is  a  computer 
aided  design  tool  for  parametric  optimization  of  dynamical  systems4. 


4CONSOLE  was  used  to  optimize  parameters  in  design  of  the  breadboard  prototype  network 
which  was  constructed 
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CHAPTER 

FIVE 


ANALOG  HARDWARE  IMPLEMENTATION 


In  order  to  realize  the  full  computational  power  of  neural  networks  it  is  im¬ 
portant  to  consider  implementation  of  neural  networks  as  analog  electrical  circuits. 
The  inverse  problem  considered  here  is  among  a  class  of  problems  which  can  in  fact 
be  solved  using  more  traditional  digital  computation.  However,  traditional  digital 
computation  is  likely  to  fall  short  of  the  needs  of  tactile  sensing  since  the  digital 
hardware  necessary  to  achieve  the  desired  speed  of  computation  is  not  likely  to  lend 
itself  to  placement  in  close  proximity  to  the  sensors.  It  is  empirically  demonstrated 
in  this  chapter  that  analog  neural  networks  can  achieve  the  required  speed  while 
limiting  hardware  requirements  to  as  little  as  a  single  integrated  circuit  chip.  This 
also  permits  dedication  of  a  single  analog  processor  chip  to  each  array  of  sensors 
thereby  distributing  the  processing.  Although  the  current  state  of  VLSI  technology 
does  not  allow  for  implementation  of  extremely  large  networks,  there  is  still  a  class 
of  useful  networks  that  can  be  implemented.  The  network  we  consider  here  for  the 
solution  of  the  inverse  problem  of  tactile  sensing  is  among  those  networks  for  which 
currently  implementable  sizes  are  sufficient  to  prove  useful. 

Integrated  circuit  implementation  of  neural  network  models  is  an  area  of  re¬ 
search  which  has  recently  inspired  the  interest  of  a  significant  community  of  re- 
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Figure  4.12:  Reconstructions  of  surface  stress  as  A  is  varied.  The  dashed  curve 
represents  the  actual  surface  stress.  As  A  increases,  the  resulting  reconstructions 
(solid  curves)  have  increasingly  suppressed  peaks  and  move  further  into  the  positive 
orthant 
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searchers.  Large  multilayered,  fully  connected  networks  require  a  large  number  of 
weighted  connections.  Even  if  these  connections  are  implemented  by  simple  resis¬ 
tors,  physical  size  of  the  chip  and  achievable  density  of  circuitry  become  limiting 
factors  as  the  size  of  the  network  becomes  large.  Current  microelectronic  tech¬ 
nology  is  essentially  two  dimensional  and  thus  there  arises  the  (graph  theoretic) 
problem  of  laying  out  the  conductive  paths  necessary  to  make  the  required  connec¬ 
tions.  These  types  of  problems  have  generated  interest  in  optical  implementations 
of  neural  networks  (see  [31]).  Synaptic  connections  in  an  optical  neural  network  can 
be  defined  in  three  dimensions  and  weights  can  be  set  by  modulating  the  intensity 
of  light  over  a  region  in  space,  and  stored  using  holographic  techniques.  However, 
optical  technology  is  still  in  its  infancy  although  it  may  prove  practical  in  the  fu¬ 
ture.  ‘Learning’  in  neural  networks  requires  the  ability  to  change  the  weight  of 
any  connection  in  the  network.  In  [32]  a  VLSI  implementation  a  neural  network  is 
presented  in  which  the  synapses  are  programmable,  but  are  restricted  to  be  either 
excitatory  (+1),  inhibitory  (-1)  or  inert  (0).  Although  the  implementation  in  [32]  is 
a  significant  contribution  to  the  development  of  neural  network  implementations, 
the  limitations  of  current  technology  are  evident.  The  network  in  [32]  (among  of 
the  largest  implemented  networks  to  date)  contains  almost  3000  programmable 
connections  and  utilizes  a  2.5  /im  technology,  but  almost  90%  of  the  6.7  x  6.7 
mm  chip  is  occupied  by  the  synapses.  In  general,  it  would  be  desirable  to  create 
synapses  with  weights  which  are  continuously  variable  through  zero  and  can  extend 
significantly  into  both  the  positive  and  negative  orthants.  In  [33],  an  approach  to 
such  continuously  programmable  synapses  is  taken  in  which  MOS  transistors  are 
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employed  as  analog  multipliers  to  effect  the  multiplication  required  of  a  synapse. 
A  number  of  researchers  are  also  exploring  the  development  of  new  and  different 
integrated  circuit  technologies  which  are  better  suited  to  neural  network  imple¬ 
mentations.  The  primary  focus  of  research  into  implementations  of  analog  neural 
networks  is  the  design  of  dense  arrays  of  programmable  synapses.  Programmability 
of  the  connection  weights,  in  the  context  of  the  inverse  problem  considered  here, 
will  permit  the  use  of  different  convolution  kernels  in  the  deconvolution  network. 
Hence  variations  due  to  changes  in  sensor  response  etc.  can  be  accommodated  if 
the  new  convolution  kernel  can  be  determined. 

In  Section  5.1  some  experimental  results  from  a  breadboard  prototype  of  the 
network  described  in  Chapter  4  are  discussed  and  Section  5.2  describes  an  initial 
approach  to  integrated  circuit  implementation. 

5.1  Breadboard  Prototype 

As  discussed  in  Chapter  4,  accurate  assesments  of  convergence  time  for  the  network 
are  not  easily  made  using  digital  computer  simulations.  Also,  in  the  analysis  of  the 
deconvolution  network  in  Chapter  3,  it  was  assumed  that  any  dynamics  associated 
with  the  constraint  plane  could  be  ignored  provided  that  the  signal  plane  amplifiers 
are  sufficiently  slower  in  response.  In  practical  implementations  of  such  a  network, 
it  is  necessary  to  understand  what  effects  delays  in  the  constraint  plane  response 
may  have  upon  the  network.  It  is  ultimately  the  constraint  plane  dynamics  which 
limit  the  speed  of  convergence  which  is  achievable.  A  formal  treatment  of  this  sub¬ 
ject  is  to  be  found  in  Marcus  and  Westervelt  [34].  A  prototype  breadboard  model 
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Figure  5.1:  Schematic  circuit  diagram  of  a  single  signal  plane  node 

of  the  deconvolution  network  was  constructed  using  ‘off-the-shelf’  operational  am¬ 
plifiers,  resistors  and  capacitors.  The  network  was  constructed  with  seven  signal 
plane  nodes  and  seven  constraint  plane  nodes.  Since  the  purpose  of  constructing 
the  breadboard  prototype  was  to  estimate  to  some  extent  the  speed  achievable  by 
such  a  network,  exponential  amplifiers  in  the  signal  plane  were  replaced  by  unity 
gain  linear  amplifiers  to  simplify  the  circuit l.  Replacing  the  exponential  amplifiers 
by  linear  amplifiers  results  in  the  entropy  regularizer  being  replaced  by  a  regularizer 
of  the  form, 

MW  =  iX>?-  (5.1.1) 

t 

In  Figure  5.1  a  circuit  diagram  of  a  single  signal  plane  node  is  shown.  Each 

signal  plane  node  consists  of  two  stages  of  amplification.  Associated  with  the  first 

stage  is  the  feedback  capacitor  C,  which  introduces  the  relevant  network  dynamics, 
*A  second  breadboard  prototype  was  also  constructed  which  contained  the  exponential  am¬ 
plifiers,  but  was  used  to  solve  a  different  problem  (see  [36]). 
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Figure  5.2:  Schematic  circuit  diagram  of  a  single  constraint  plane  node 
and  the  feedback  resistor  R ,  which  introduces  and  weights  (A  =  l/R)  the  regular¬ 
izing  term  in  the  energy  function.  The  second  stage  of  each  signal  plane  node  is 
configured  as  an  analog  inverter.  Negative  connection  weights  are  implemented 
simply  by  using  the  inverted  output  of  the  node.  Figure  5.2  shows  a  single  con¬ 
straint  plane  node.  Each  constraint  plane  node  also  consists  of  two  stages.  The  first 
stage  is  configured  as  a  virtual  ground  transimpedance  amplifier,  which  provides 
the  feedback  gain  for  signals  fed  back  to  the  signal  plane.  As  in  the  signal  plane, 
the  second  stage  of  each  constraint  plane  node  is  an  analog  inverter. 

The  interconnection  matrix  [T^]  was  chosen  as  a  seven  point  discretization  of 
the  elastic  kernel  &”(•)  in  equation  (4.3.16)  and  implemented  using  resistors  with 
values  R{j  —  l/Tp.  Thus  if  the  voltage  output  Vk  of  node  k  is  connected  to  the 
input  of  node  j  through  a  resistor  Rkj  then  current  input  to  node  j  due  to  node  k 
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is  given  by  Ohm’s  Law  as, 

*>* - i;  f5-1-2* 

which  is  as  desired. 

Inputs  to  the  network  (currents  injected  into  the  constraint  plane)  were  chosen 
to  represent  samples  of  the  strain  distribution  due  to  the  compressive  loading  profile 
used  for  the  simulations  in  Chapter  4.  In  Figure  5.3  photographs  of  the  7-Channel 
breadboard  prototype  of  the  deconvolution  net  work  and  the  experimental  setup 
used  for  testing  it  axe  shown.  Input  currents  to  the  network  are  clocked  using  a 
1  kHz  relaxation  oscillator  so  as  observe  transients  (as  the  network  evolves)  using 
an  oscilloscope.  Outputs  of  the  network  are  captured  by  a  MetaResearch  data 
acquisition  board  used  in  conjunction  with  a  Macintosh  Plus  computer  and  the 
resultant  reconstruction  is  plotted  on  the  Macintosh  display. 

The  rise  time  of  the  constraint  plane  amplifiers  was  measured  to  be  approxi¬ 
mately  1  /usee.  Actual  response  time  of  the  constraint  plane  would  be  longer  than 
this  since  the  parallel  combination  of  all  resistors  connected  to  the  input  of  any 
node  contribute  to  the  EC  time  constant.  It  was  observed  that  for  choices  of  the 
signal  plane  capacitors  C  for  which  the  rise  time  of  the  outputs  of  the  network 
would  be  below  10  /usee,  the  outputs  would  oscillate  i.e.  the  network  was  unstable. 
For  (7=10  pF  the  rise  time  of  the  outputs  of  the  network  was  measured  to  be  10 
/usee  (see  Figure  5.4).  It  is  clear  that  the  use  of  faster  operational  amplifiers  would 
result  in  an  increase  in  achievable  speed  since  this  would  decrease  the  constraint 
plane  response  time  and  thereby  permit  a  decrease  in  the  time  constant  of  the 
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Figure  5.3:  Top:  Photograph  of  7-Channel  breadboard  prototype  of  deconvolution 
network  Bottom:  Photograph  of  experimental  setup  used  to  test  the  deconvolution 
network. 
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Figure  5.4:  Photographs  of  oscilloscope  traces  showing.  Top:  time  evolution  of  a 
single  output  of  the  signal  plane,  and  Bottom:  time  evolution  of  a  single  output  of 
the  constraint  plant,  tor  Liu  1 -channel  breadboard  prototype,  deconvolution  network. 


Figure  5.5:  Reconstruction  of  surface  stress  by  breadboard  prototype  deconvolution 
network.  Top:  Reconstruction  prior  to  settling  of  outputs,  Bottom:  Final  recon¬ 
struction. 
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signal  plane. 

Settling  time  and  overshoot  of  the  outputs  of  the  network  are  controlled  by  the 
gain  of  the  constraint  plane  nodes.  CONSOLE  (see  [30])  was  used  to  choose  a  value 
for  the  gain  so  as  to  minimize  overshoot  and  settling  time. 

5.2  Integrated  Circuit  Prototype 

A  prototype  analog  integrated  circuit  implementation  of  the  deconvolution  net¬ 
work  described  here  has  been  fabricated,  but  remains  to  be  tested.  A  hierarchical 
design  philosophy  is  practiced  in  this  initial  implementation.  The  deconvolution 
network  may  be  thought  of  as  composed  of  two  sections:  (i)  Active  components 
of  the  network  including  signal  and  constraint  plane  amplifiers  and  (ii)  The  func¬ 
tionally  passive2  resistive  interconnection  matrix.  These  two  sections  may  also  be 
thought  of  in  the  following  manner.  Once  the  size  of  the  deconvolution  network 
(number  of  inputs  and  outputs)  has  been  decided,  the  amplifiers  of  the  network  are 
determined.  However,  the  resistive  matrix  may  be  a  variable  entity.  For  instance, 
given  two  different  elastic  materials  (or  even  two  different  thicknesses  of  a  given 
elastic  material),  the  convolution  kernel  &”(•)  and  hence  its  discretization  [T)j]  are 
in  general  different.  Thus  the  network  may  be  thought  of  as  being  composed  of  a 
fixed  part  and  a  variable  part. 

If  fixed  resistors  are  to  be  used  to  implement  the  interconnect  matrix,  then 
2The  term  “functionally  passive’  here  is  used  to  describe  the  fact  that  in  some  situations  it 
is  desirable  to  use  active  circuit  components  configured  to  look  like  a  passive  resistor  from  an 
input/output  standpoint. 
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Internal  Wire  Bonds 


Bonding  Pads 


Figure  5.6:  Resistive  matrix  chip  is  placed  directly  atop  the  amplifier  chip  and 
connected  to  it  using  local  bonding 

some  provision  should  be  made  to  change  this  matrix  without  having  to  refabricate 
the  rest  of  the  network.  In  order  to  provide  some  flexibility  in  the  choice  of  the 
interconnect  matrix  and  to  permit  the  use  of  two  different  fabrication  technologies 
(as  discussed  in  Section  6.2.3),  the  deconvolution  network  was  fabricated  as  two 
separate  integrated  circuits. 

The  amplifier  chip,  which  is  described  in  Section  6.2.2,  is  designed  to  serve  as  the 
‘motherboard’  for  the  network  on  top  of  which  the  resistive  connection  matrix  chip 
(which  is  described  in  section  6.2.3)  is  placed  (see  Figure  5.6).  Connections  between 
the  two  chips  are  made  by  local  wire  bonds  between  bonding  pads  provided  for  this 
purpose  on  both  chips.  This  approach  also  facilitates  experimentation  with  different 
types  of  connection  matrices  such  as  those  with  programmable  connections. 
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5.2.1  The  Amplifier, Chip 


Figure  5.7  shows  the  layout  of  the  amplifier  section  of  the  deconvolution  network. 
This  chip  provides  the  signal  plane  and  constraint  plane  amplifiers  for  a  deconvolu¬ 
tion  network  with  eleven  input  nodes  and  eleven  output  nodes.  Fabrication  of  the 
amplifier  section  of  the  deconvolution  network  was  undertaken  through  the  MOSIS 
facility,  using  a  two  metal  layer  CMOS  p-well  technology  with  a  minimum  feature 
size  of  3/um.  The  size  of  the  amplifier  chip  is  approximately  6300^m  x  8800/im. 
In  the  center  of  the  amplifier  chip,  a  4000/xm  x  3200 fim  space  has  been  provided 
for  placement  of  the  second  chip  containing  the  resistive  connection  matrix.  It  is 
actually  not  necessary  to  leave  this  space  since  the  chip  is  passivated,  but  for  an 
initial  implementation,  it  was  provided. 

A  total  of  forty  four  operational  amplifiers  are  implemented  on  this  chip;  twenty 
two  for  the  signal  plane  and  twenty  two  for  the  constraint  plane.  Each  signal  plane 
node  and  constraint  plane  node  is  identical  in  structure  to  the  signal  and  constraint 
plane  nodes  used  for  the  breadboard  prototype.  Both  the  inverted  and  noninverted 
outputs  of  the  signed  and  constraint  plane  nodes  are  connected  to  bonding  pads 
adjacent  to  the  area  where  the  resistive  matrix  is  to  be  placed.  Thus  provision  is 
made  for  the  implementation  of  positive  and  negative  connection  weights.  Outputs 
and  inputs  of  both  the  signal  plane  nodes  and  constraint  plane  nodes  are  connected 
to  bonding  pads  located  on  the  periphery  of  the  die  to  permit  access  to  these  nodes 
after  the  chip  has  been  packaged.  Packaging  of  the  chip  requires  a  package  with  a 
minimum  of  fifty  pins,  if  the  inputs  and  outputs  all  nodes  are  to  be  accessible. 
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Figure  5.7:  Layout  of  integrated  circuit  chip  containing  all  amplifiers  for  an  eleven 
channel  deconvolution  network 
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Figure  5.8:  Left:  Layout  of  signal  plane  amplifier;  Right:  Layout  of  constraint  plane 
amplifier. 
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The  basic  building  block  of  the  amplifier  chip  is  the  operational  amplifier.  Fig¬ 
ure  5.9  shows  a  circuit  schematic  of  the  operational  amplifier  designed  for  this 
purpose.  Design  of  the  amplifier  takes  into  consideration  the  current  driving  re¬ 
quirements  for  use  in  the  deconvolution  network.  Inputs  on  each  amplifier  are  diode 
protected- against  spikes.  A  source  follower  output  stage  is  used  on  every  amplifier 
so  as  to  meet  the  current  driving  requirements  while  maintaining  relatively  low 
circuit  complexity  (compared  to,  for  example  a  push-pull  output  stage). 

As  can  be  seen  from  Figure  5.9,  an  internal  compensation  capacitor  of  5pf  is 
required  for  every  operational  amplifier.  In  addition  to  this,  external  feedback 
capacitors  of  about  lOpF  are  required  at  the  first  stage  of  every  signal  plane  node. 
Since  the  technology  available  did  not  include  an  additional  electrode  layer  (i.e.  a 
second  layer  of  polysilicon  to  be  used  in  forming  capacitors),  all  capacitors  in  the 
network  are  formed  as  parallel  plate  capacitors  with  polysilicon  as  one  electrode  and 
the  first  metal  layer  as  the  second  electrode.  Capacitance  between  the  first  metal 
layer  and  polysilicon  is  approximately  a  factor  of  eight  less  than  the  capacitance 
between  polysilicon  and  an  electrode  layer.  Hence  about  eight  times  as  much  area 
is  used  to  form  the  capacitors,  compared  to  capacitors  formed  using  an  electrode 
layer.  Since  about  half  of  the  area  used  by  constraint  plane  nodes  and  two  thirds 
of  the  area  used  by  the  signal  plane  nodes  is  used  to  form  the  capacitors,  about 
seven  twelveths  of  the  total  area  used  by  the  amplifiers  is  taken  up  by  capacitors. 
Hence  the  use  of  a  separate  electrode  layer  would  result  in  about  a  50%  reduction 
in  space  utilization.  Therefore  a  network  with  twenty  two  inputs  and  twenty  two 
outputs  is  a  trivial  extension  of  the  current  structure  (which  includes  the  space 
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provided  for  the  resistive  matrix)  using  a  comparable  3 fxm  technology. 

5.2.2  The  Resistive  Interconnection  Matrix  Chip 

In  any  implementation  of  a  matrix  of  resistive  elements,  it  is  necessary  to  consider 
the  effects  of  process  induced  variations  on  the  performance  of  the  network  with 
which  it  is  to  be  used.  Current  integrated  circuit  process  technology  may  introduce 
variations  in  the  value  of  any  given  resistor  as  large  as  20%.  It  must  either  be 
established  that  degradation  in  performance  of  the  network  due  to  such  variations 
is  irrelevant,  or  an  approach  to  implementation  must  be  taken  which  preserves  the 
essential  characteristics  of  the  interconnection  matrix  in  the  face  of  process  induced 
variations.  The  latter  approach  was  taken  in  this  implementation. 

Since  in  the  case  of  the  deconvolution  network,  the  resistive  interconnection 
matrix  is  a  discretization  of  the  elastic  convolution  kernel,  it  is  reasonable  to  try 
to  preserve  the  shape  of  the  connectivity  profile.  That  is,  it  is  the  ratiometric 
relationship  of  the  connective  weights  that  is  truely  important  since  any  other 
variations  correspond  to  a  simple  scaling  of  the  inputs  or  outputs.  In  an  attempt 
to  preserve  the  shape  of  the  connectivity  profile  in  the  face  of  processing  induced 
variations,  the  connective  weights  are  quantized.  Figure  5.10  shows  the  layout  of 
the  resistive  matrix.  A  discrete  resistive  element  is  formed  by  etching  a  5 fim  x  5 fim 
opening  in  the  oxide  layer  between  two  metal  wires  and  then  evaporating  silicon 
into  the  opening  (see  Figure  5.11).  Amorphous  silicon,  therefore  forms  the  resistive 
material.  Each  such  discrete  resistor  has  a  resistance  value  that  depends  on  the 
thickness  of  the  oxide  layer  and  the  area  of  the  opening.  For  the  oxide  thickness 
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Figure  5.10:  Layout  of  resistive  interconnection  matrix  chip. 
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Figure  5.11:  A  discrete  resistive  element  as  used  in  the  resistive  matrix  chip. 

used  (and  5 pm  x  5 pm  openings)  the  value  of  a  single  discrete  resistor  was  measured 
to  be  approximately  35KQ.  A  wire  grid  is  formed  using  the  first  metal  layer  to 
construct  parallel  wires  in  one  direction  and  the  second  metal  layer  to  construct 
wires  in  a  orthogonal  direction  (to  the  wires  formed  in  the  first  metal  layer).  At 
each  point  on  the  grid,  a  connective  weight  is  defined  by  placing  a  number  of 
discrete  resistors  of  the  type  described  at  that  point.  The  strength  of  connection 
at  any  point  on  the  grid  is  determined  by  the  number  of  discrete  resistors  placed 
there.  Ratiometric  relationships  between  elements  of  the  connection  matrix  should 
for  the  most  part  be  preserved  in  this  approach  since  the  ratio  of  two  connective 
weights  is  primarily  determined  by  the  ratio  of  the  number  of  discrete  resistors 
forming  each  connection  Any  variations  in  the  ratios  are  the  effects  of  phenomena 
such  as  nonuniform  oxide  thickness  over  the  area  of  the  die,  nonuniform  deposition 
of  amorphous  silicon  which  are  relatively  small  effects. 
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CHAPTER 

_ SIX 

CONCLUSIONS  AND  FUTURE  DIRECTIONS 


In  this  thesis,  we  considered  the  inverse  problem  of  recovering  stress  distribu¬ 
tions  over  regions  of  contact  from  samples  of  strain  provided  by  an  array  of  tactile 
sensors.  In  the  case  where  stress  is  applied  to  the  surface  of  a  compliant  material 
and  strain  is  measured  at  a  fixed  depth  beneath  the  surface,  the  inverse  problem 
was  shown  to  be  a  problem  of  deconvolution.  It  was  shown  in  Chapter  2  that  this 
inverse  problem  of  tactile  sensing  is  ill-posed  in  the  sense  defined  by  Hadamard. 
Also  in  Chapter  2,  it  was  shown  that  the  technique  of  regularization  could  be  used 
to  introduce  a  priori  knowledge  into  the  problem  in  order  to  obtain  solutions.  The 
constraints  of  non-negativity  and  smoothness  were  imposed  by  choosing  an  en¬ 
tropy  regularizer  for  the  recovery  of  normal  surface  stress.  Solutions  to  the  inverse 
problem  could  then  be  obtained  by  minimization  of  a  cost  functional. 

In  Chapter  3,  it  was  shown  that  under  certain  hypotheses,  energy  functions  may 
be  explicitly  determined  for  nonlinear  RC  electrical  networks.  Stable  equilibria  of 
the  network  were  shown  to  correspond  to  local  minima  of  the  energy  function  and 
conditions  for  stability  of  the  network  were  determined. 

An  analog  neural  network  for  regularized  solution  of  the  inverse  problem  was 
presented  in  Chapter  4.  Using  the  results  of  Chapter  3,  it  was  shown  that  the  energy 
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function  of  the  proposed  network  corresponds  to  the  variational  principle  described 
in  Chapter  2  for  solution  of  the  inverse  problem  of  tactile  sensing.  Stability  of 
the  network,  in  terms  of  electrical  circuit  analysis,  was  shown  to  be  guaranteed 
by  monotonicity  of  the  characteristics  of  the  signal  plane  amplifiers.  It  was  also 
determined  that  any  regularizer  which  could  be  written  in  the  form  of  equation 
(4.2.14)  and  satisfied  the  monotonicity  requirements  for  the  signal  plane  amplifier 
characteristic  g,  could  be  incorporated  into  the  energy  function  for  the  network. 
Computer  simulations  demonstrated  the  ability  of  the  deconvolution  network  to 
accurately  recover  normal  surface  stress  even  when  the  sensor  outputs  were  severely 
corrupted  by  noise. 

A  breadboard  prototype  of  the  deconvolution  network  was  used  to  demonstrate 
the  computational  speed  achievable  by  such  a  hardware  implementation.  Con¬ 
vergence  time  for  the  breadboard  prototype  was  measured  to  be  approximately 
10/isec.  An  integrated  circuit  implementation  of  the  proposed  deconvolution  net¬ 
work  was  undertaken.  An  hierarchical  approach  was  taken  in  the  integrated  circuit 
implementation  to  provide  flexibility  in  the  design  of  an  interconnection  matrix. 

Several  aspects  of  tactile  sensing  are  of  a  very  similar  nature  as  problems  en¬ 
countered  in  computational  vision.  Essential  differences  lie  in  the  fact  that  unlike 
vision  sensors,  which  are  remotely  located  with  respect  to  their  target,  tactile  sen¬ 
sors  are  required  to  be  in  physical  contact  with  their  targets.  The  ‘deblurring’ 
problem  considered  in  this  thesis  is  an  example  of  a  problem  which  also  arises  in 
computational  vision.  Edge  detection  is  used  in  vision  to  determine  boundaries 
within  the  visual  field  and  is  required  in  tactile  sensing  to  identify  physical  edges 
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of  objects,  and  locate  holes  to  determine  shape.  A  similar  analogy  can  be  drawn 
between  motion  detection  in  vision  and  identifying  slippage  in  tactile  sensing. 

The  set  of  processes  that  recover  physical  attributes  of  visible  three  dimensional 
objects  from  two-dimensional  visual  (intensity)  images,  is  collectively  termed  as 
early  vision.  In  a  similar  manner,  the  elements  of  the  second  hierarchical  level  of 
tactile  sensing  (described  in  Chapter  1)  may  be  collectively  termed  early  touch. 
Early  touch  then  can  be  defined  as  the  set  of  processes  that  recover  the  three- 
dimensional  attributes  of  an  object  and  properties  of  the  established  contact,  from 
two-dimensional  arrays  of  sensor  measurements.  As  yet,  the  set  of  problems  of 
which  early  touch  is  comprised  have  not  been  clearly  defined,  but  recovery  of  stress 
over  the  contact  region,  edge  detection,  and  identification  of  slippage  are  likely 
among  them. 

Most  early  vision  problems  are  ill- posed  and  regularization  has  successfully  been 
applied  in  solving  many  of  them  (see  e.g.  [35]  and  [3]).  However,  regularization 
as  a  technique  for  solving  ill-posed  problems  has  also  demonstrated  limitations. 
Among  the  limitations  of  standard  regularization  theory,  is  its  inability  to  effec¬ 
tively  cope  with  discontinuities  [36].  If  the  operators  K  and  P  in  equation  (2.6.24) 
are  linear  (as  in  standard  regularization  theory),  the  solution  space  is  essentially 
restricted  to  generalized  splines.  Hence  in  some  cases  the  resultant  solutions  will 
be  overly  smooth  and  cannot  be  trusted  at  discontinuities  such  as  edges.  It  has 
been  suggested  [37]  that  after  standard  regularization,  locations  where  the  resul¬ 
tant  solution  originates  a  ‘large’  error  in  the  second  (regularizing)  term  of  equation 
(2.6.24)  can  be  identified  as  locations  of  discontinuities.  A  second  second  stage 
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of  regularization  can  then  be  performed,  using  the  locations  of  discontinuities  as 
boundary  conditions.  This  approach  requires  the  choice  of  a  threshold  for  error  in 
the  regularizing  term  in  order  to  identify  discontinuities.  Furthermore  the  task  of 
locating  discontinuities  is  hindered  by  the  smoothing  due  to  the  first  stage  of  reg¬ 
ularization.  Marroquin  in  [38]  proposes  a  nonquadratic  stabilizer  which  preserves 
discontinuities  in  reconstructions  of  surfaces  from  depth  data  by  embedding  prior 
knowledge  about  the  geometry  of  discontinuities. 

Another  approach  to  overcome  some  of  the  limitations  of  standard  regulariza¬ 
tion  theory,  is  based  on  Bayesian  estimation  and  Markov  random  field  models  of  the 
image  (tactile  or  visual).  This  approach  (see  [36]  uses  prior  knowledge  represented 
in  terms  of  appropriate  probability  distributions  instead  of  directly  restricting  the 
solution  space.  It  can  be  shown  [39]  (see  also  [36]  and  [35]),  that  maximizing 
the  a  posteriori  probability,  is  equivalent  to  minimizing  an  expression  of  the  form 
(2.6.24).  However,  in  this  case  the  functional  to  be  minimized  is  not  in  general 
quadratic  as  a  whole. 

It  is  clear  that  the  class  of  problems  for  which  quadratic  variational  princi¬ 
ples  are  sufficient,  is  limited.  For  every  quadratic  variational  principle,  it  can  be 
shown  that  there  exists  a  corresponding  linear  analog  electrical  network  consisting 
of  resistors,  voltage  sources,  and  current  sources,  which  has  the  same  solutions. 
This  fact  is  used  in  [3]  to  synthesize  analog  resistive  networks  to  solve  problems 
in  early  vision.  In  [40],  the  approach  taken  to  solving  a  nonquadratic  variational 
principle  employs  a  hybrid  analog-digital  network  which  at  each  iteration  (on  the 
digital  time  scale)  solves  a  quadratic  variational  principle  (in  analog).  In  general, 
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nonquadratic  variational  principles  may  posses  numerous  local  minima  in  addition 
to  a  global  minimum  (which  may  or  may  not  exist).  The  deterministic  gradient 
descent  approach  taken  in  [40]  demonstrated  an  ability  to  perform  well  (qualita¬ 
tively)  in  comparison  to  statistical  annealing  which  converges  to  the  global  solution 
with  probability  one  if  appropriately  applied.  However,  convergence  to  the  global 
solution  cannot  be  guaranteed  and  the  hybrid  analog-digital  nature  of  the  network 
introduces  additional  hardware  complexity. 

In  this  thesis,  it  was  shown  that  a  strictly  analog  network  can  be  structered  so  as 
to  solve  a  nonquadratic  variational  principle.  Convergence  to  the  global  solution,  in 
this  case,  is  guaranteed  since  the  variational  principle  of  (2.7.27)  is  strictly  convex. 
It  was  also  shown  that  there  exists  a  class  of  non  quadratic  variational  principles  (see 
equation  (4.2.14  ))  which  can  be  solved  by  similar  networks  by  choosing  appropriate 
characteristics  g(-)  for  the  signal  plane  amplifiers.  In  the  case  of  this  larger  class 
of  nonquadratic  variational  principles,  convergence  to  global  solutions  cannot  in 
general  be  guaranteed  since  multiple  minima  may  exist.  Techniques  such  as  adding 
noise  to  the  network  can  be  used  to  aid  in  escaping  from  local  minima  in  an  attempt 
to  find  the  global  solution.  Two  interesting  questions  arise  in  this  context:  (i)For 
any  nonquadratic  variational  principle,  under  what  conditions  does  there  exist  a 
(possibly  nonlinear  and  active)  analog  network  with  the  same  solution?  and  (ii)How 
can  optimality  of  the  solutions  be  guaranteed? 

So  far  the  discussion  in  this  thesis  has  been  confined  primarily  to  the  problems 
of  early  touch.  The  highest  level  of  the  tactile  sensing  hierarchy,  which  has  been 
ignored  so  far,  is  crucial  to  the  usefulness  of  any  tactile  sensing  system.  A  higher 


85 


level  description  of  the  tactile  environ  environment  is  the  next  step  beyond  the  low- 
level  description  provided  by  the  processes  of  early  touch.  For  instance,  although 
a  low-level  description  of  a  grasped  object  may  be  sufficient  to  secure  the  object  in 
a  stable  grasp,  it  is  not  adequate  to  directly  identify  the  object.  In  this  aspect  of 
tactile  sensing  as  well,  neural  networks  may  provide  a  solution.  Adaptive  neural 
networks  have  demonstrated  a  remarkable  ability  to  ‘learn’  complex  representations 
and  successfully  classify  patterns  based  on  these  representations.  Among  other 
applications  of  such  neural  network  classifiers  are  handwritten  character  recognition 
[32],  identification  of  faces  [31],  and  classification  of  superposed  radar  return  signals 
[?].  Successful  VLSI  implementation  of  an  adaptive  neural  network  classifier  for 
recognition  of  grasped  objects  may  further  reduce  the  computational  load  of  the 
central  processor. 

It  was  shown  by  construction  of  a  breadboard  circuit  (in  Chapter  5)  that  analog 
hardware  implementation  of  the  network  proposed  in  Chapter  4  leads  to  conver¬ 
gence  times  in  the  order  of  10/isec.  In  order  to  compare  this  with  digital  compu¬ 
tation,  we  note  that  simple  inversion  of  an  n  x  n  symmetric  Toplitz  matrix,  is  of 
computational  complexity  0(n2).  For  the  purpose  of  a  biased  comparison  (biased 
in  favor  of  digital  computation),  we  can  ignore  the  regularizer  and  assume  that  it 
takes  exactly  n  operations  to  invert  the  matrix  A.  Then  for  a  modest  array  of  25 
tactile  sensors,  it  would  be  necessary  to  perform, 

o  operations  ,  solutions  Million  operations 

252'  -  -  x  105 - 7-  =  62.5 - - 

solution  second  second 

in  order  to  keep  up  with  the  processing  speed  of  the  analog  network.  This  is  clearly 
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not  possible  for  local  digital  computation.  Also  as  the  size  of  the  sensor  array  in¬ 
creases,  the  processing  time  required  for  digital  computation  increases  quadraticaly. 
It  was  noted  in  Chapter  4  that  in  the  case  of  the  analog  network,  convergence  time 
actually  decreases  as  the  size  of  the  problem  increases.  In  [2]  it  was  established 
through  survey  that  processing  times  of  approximately  l-2ms  are  sufficiently  fast 
for  most  tactile  sensing  applications.  Additional  available  time  resulting  from  the 
use  of  analog  network  processors  could  be  utilized  to  perform  other  tasks  such  as 
predicting  slippage  based  upon  prior  and  current  processing  results.  An  initial  ap¬ 
proach  to  analog  integrated  circuit  implementation  of  the  deconvolution  network 
was  also  discussed  in  Chapter  5.  A  hierarchical  design  strategy  was  employed  to 
permit  some  flexibility  in  the  pattern  of  interconnections  in  the  network.  Mating 
of  the  amplifier  chip  and  resistive  matrix  chip  in  the  manner  discussed  in  Chap¬ 
ter  5  is  not  however  optimal  and  this  approach  was  only  used  for  experimental 
purposes.  Technologies  such  as  ‘Indium  bump’  bonding  are  more  suitable  for  such 
a  design  and  would  permit  greater  density  of  circuitry  on  both  the  amplifier  and 
resistive  matrix  chips.  In  this  technology,  the  two  chips  would  also  be  stacked  one 
on  top  of  the  other,  but  connections  between  the  two  chips  would  be  made  by 
placing  small  ‘bumps’  of  Indium  in  corresponding  locations  on  both  chips  and  then 
heating  the  whole  structure  which  results  in  the  Indium  melting  and  alignment  of 
corresponding  bumps  on  both  chips  due  to  surface  tension. 

It  is  clear  that  there  remain  a  great  many  unaddressed  and  unsolved  problems 
in  the  area  of  tactile  sensing.  Tactile  sensing  has  not  received  the  attention  of 
researchers  to  the  same  extent  as  vision  has.  As  in  vision,  there  is  a  need  in  tactile 
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sensing  to  identify  and  formalize  the  problems  involved  and  then  devise  sensible 
solutions  to  these  problems.  We  have  suggested  in  this  thesis  that  analog  neu¬ 
ral  networks  may  provide  a  solution  to  some  of  the  low-level  processing  aspects  of 
tactile  sensing  and  possibly  even  some  of  the  higher  level  tasks  such  as  object  recog¬ 
nition.  However,  we  have  only  addressed  a  small  portion  of  a  large  and  complex 
problem. 
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